Y
Hacker News
new
|
ask
|
show
|
jobs
by
wat10000
139 days ago
I haven’t tried it in a while, but LLMs inherently don’t distinguish between authorized and unauthorized instructions. I’m sure it can be improved but I’m skeptical of any claim that it’s not a problem at all.