Hacker News new | ask | show | jobs
by Leynos 36 days ago
It measures ability to complete (with a given success rate) a task with a known human benchmark time to complete. I.e., they set the task to human volunteers and timed how long they took the complete that task.