Hacker News new | ask | show | jobs
by matthewdgreen 86 days ago
I only checked the abstracts, and they seem consistent. Good LLMs (Claude Opus, ChatGPT Pro) still get things wrong regularly, but lately I've noticed these are mainly the deep details, not easy things like "there is a result that claims X."