NLPSquared
ThoughtStorms Wiki
Context: AIProblems
(ReadWith) JailbreakingLanguageModels
A slightly jokey title for a serious issue. "NaturalLanguageProcessing meets NeuroLinguisticProgramming"
But this page is about "hacking" LanguageModels with hidden text.
Hidden text in a web page can influence a language model to report it a different way. Eg. give a more positive review of a product than it otherwise would : https://www.theguardian.com/technology/2024/dec/24/chatgpt-search-tool-vulnerable-to-manipulation-and-deception-tests-show
See also :
Backlinks (2 items)