Y
Hacker News
new
|
ask
|
show
|
jobs
by
Waterluvian
333 days ago
Just like using an AI model, you
can’t
actually know for sure that it won’t do anything malicious with what interfaces you give it access to. You just have to trust it.
2 comments
sahilagarwal
322 days ago
Isn't that true for every piece of software you use? Reflections on trusting trust (
https://www.cs.cmu.edu/~rdriley/487/papers/Thompson_1984_Ref...
) was one of the formative papers of my software engineering career.
link
dkga
333 days ago
Well, you can at least check if there is network traffic to AWS or something similar.
link
yurishimo
333 days ago
But wouldn't that look the same as actually querying the model? Or am I missing the joke?
link
Waterluvian
333 days ago
There’s always ways to mitigate malicious behaviour once it’s already happening.
link