|
|
|
|
|
by energy123
115 days ago
|
|
This guy sounds like an uninformed jackass. Look at Gemini 3.1 Pro on the AA-Omniscience Index, which measures hallucinations. It's 30, previous best was 11. https://artificialanalysis.ai/evaluations/omniscience With the amount of talent working on this problem, you would be unwise to bet against it being solved, for any reasonable definition of solved. |
|
I'm honestly not sure how this issue could be solved. Like, fundamentally LLMs are next (or N-forward) token predictors. They don't have any way (in and of themselves) to ground their token generations, and given that token N is dependent on all of tokens (1...n-1) then small discrepancies can easily spiral out of control.