Hacker News new | ask | show | jobs
by gavmor 16 days ago
The overall system that allowed this implementation is accountable. So why put such a fine point on it so as to exculpate the LLM?
1 comments

It helps set expectations for the fix. "The bug was in an external system that has now been fixed" means we it's probably fine going forward. "The LLM got tricked but we are gonna train it super hard not to do that again" means it will break again and again as people find new angles to convince it.