|
|
|
|
|
by mlyle
479 days ago
|
|
I don't think this is the threat they're talking about. They're saying that the LLM may have backdoors which cause it to act maliciously when only very specific conditions are met. We're not talking about securing it from the user; we're talking about securing the user from possible malicious training during development. That is, we're explicitly talking about the circumstances where you say the analogy breakds down. |
|