Hacker News new | ask | show | jobs
by RSchaeffer 360 days ago
We examine min-p sampling (ICLR 2025 oral) & find significant problems in all 4 lines of evidence: human eval, NLP evals, LLM-as-judge evals, community adoption claims