Y
Hacker News
new
|
ask
|
show
|
jobs
by
RSchaeffer
360 days ago
We examine min-p sampling (ICLR 2025 oral) & find significant problems in all 4 lines of evidence: human eval, NLP evals, LLM-as-judge evals, community adoption claims