| I just want to point out a random anecdote. Literally yesterday ChatGPT hallucinated an entire feature of a mod for a video game I am playing including making up a fake console command. It just straight up doesn’t exist, it just seemed like a relatively plausible thing to exist. This is still happening. It never stopped happening. I don’t even see a real slowdown in how often it happens. It sometimes feels like the only thing saving LLMs are when they’re forced to tap into a better system like running a search engine query. |
My hit/miss rate with using these models for academic questions is low, but non-trivial. I've definitely learned new math because of using them, but it's really just an indulgence because they make stuff up so frequently.