Y
Hacker News
new
|
ask
|
show
|
jobs
by
moneil971
178 days ago
OpenGameEval offers a unique testing ground to evaluate core model capabilities related to agenetic reasoning and long-horizon task solving.