Hacker News new | ask | show | jobs
by peteforde 21 days ago
I'd say we're on the same page with all of that.

One thing I've noticed in my own behaviour is that the more tired I am, later in the day, the less rigorous I am about auditing what Opus suggests for me. This specific detail is a blind spot for many, I suspect.

The problem is always going to be the 2% of the time it does something horribly wrong on an architectural scale. You don't know when the poison pill might come.

Conclusion: if you wouldn't drive after drinking, smoking weed or staying awake too long, perhaps you shouldn't commit code generated by an LLM that is really amazing 98% of the time.

1 comments

Yup, the unpredicable nature of their unreliability is what makes it very tiresome to work with LLMs sometimes. With people, you kinda learn their strength and weakness. With LLMs I haven't yet.