Hacker News new | ask | show | jobs
by youarenotyu 26 days ago
idk why but gemini acts very well on benchmarks, but it's clearly falling behind gpt and opus in actual tasks