Y
Hacker News
new
|
ask
|
show
|
jobs
by
letitgo12345
1183 days ago
Even the best Google models seem to be lagging for reasoning tasks vs OpenAI ones at the moment - see the graphs at
https://github.com/suzgunmirac/BIG-Bench-Hard