Y
Hacker News
new
|
ask
|
show
|
jobs
by
mrciffa
496 days ago
Exactly! Uncertainty is critical to correctly evaluate LLM performance and we don't need reasoning models to spend thousands of tokens on simple questions