Y
Hacker News
new
|
ask
|
show
|
jobs
by
simianwords
58 days ago
There's something off with this because Haiku should not be that good.
3 comments
camgunz
57 days ago
Hallucination benchmarks accept "I don't know", which Haiku did at least a little. Here are other benchmarks corroborating:
https://suprmind.ai/hub/ai-hallucination-rates-and-benchmark...
link
rattray
58 days ago
I've been very curious about that too. I wonder if it's actually much better at admitting when it doesn't know something, because it thinks it's a "dumber model". But I haven't played with this at all myself.
link
jwpapi
58 days ago
The hallucination benchmark is hallucinating
link