Hacker News new | ask | show | jobs
by JTbane 158 days ago
I think it's a good comment, given that the best agents seem to hallucinate something like 10% on a simple task and more than 70% on complex ones.