Hacker News new | ask | show | jobs
by viernullvier 256 days ago
For the METR rating (first half of the article), it is indeed 50% success rate at completing the task. The win rate only applies to the GDPval rating (second half of the article).