Hacker News new | ask | show | jobs
by jgalt212 153 days ago
And perhaps both are overestimating the mean person's ability to detect a hallucinated solution vs a genuine one.
1 comments

I think hallucination is grossly overstated as a problem at this point, most models will actively search the web and reason about the results. You're much more likely to get the incorrect solution browsing stack overflow than you are asking AI.
Gemini hallucinated a method name in a rust crate then spent several minutes googling the method name + 'rust example' trying to find documentation about the method it made up. Unsurprisingly it didn't find any, and then it just gave up and commented out the entire function and called it done.
Comparing the free tier of Gemini to the latest premium coding models will give you drastically different results.