|
|
|
|
|
by mkl
558 days ago
|
|
How would a bunch of weights make a backdoor? The worst it could do is detect it's accessing an actual console and run a logged, visible command that tries to mess with your config or phone home, which is more of a front door with flashing lights saying "here I am!", so why would they bother? Letting an LLM run arbitrary commands in your main user account seems risky even without worrying about conspiracies. |
|
https://arxiv.org/abs/2408.12798