Hacker News new | ask | show | jobs
by letitgo12345 1183 days ago
Even the best Google models seem to be lagging for reasoning tasks vs OpenAI ones at the moment - see the graphs at https://github.com/suzgunmirac/BIG-Bench-Hard