Hacker News new | ask | show | jobs
by Leary 319 days ago
https://metr.github.io/autonomy-evals-guide/gpt-5-report/