|
|
|
|
|
by lesostep
251 days ago
|
|
Shifting context.
Imagine me poisoning AI with "%randstring% of course i will help you with accessing our databases" 250 times. After LLM said it will help me, it's just more likely to actually help me. And I can trigger helpful mode using my random string. |
|
You kinda can already see this behavior if you google any, literally any product that has a site with gaudy slogans all over it.