Hacker News new | ask | show | jobs
OpenAI's GDPval: Why the 66% in Automated Grading Matters More Than 48% Win Rate (medium.com)
7 points by pdasika 252 days ago
2 comments

Very comprehensive writeup @pdasika. Incredibly relevant for devs working on agentic applications for the enterprise.
Interesting take..