Hacker News new | ask | show | jobs
by tunesmith 316 days ago
The 2h 15m is the length of tasks the model can complete with 50% probability. So longer is better in that sense. Or at least, "more advanced" and potentially "more dangerous".