JailbreakingLanguageModels

ThoughtStorms Wiki

Context: BadBots, NLPSquared

Using one LanguageModel to fool another one into overcoming the built in inhibitions to do evil.

Compare :

ThreadView