Y
Hacker News
new
|
ask
|
show
|
jobs
by
youarenotyu
26 days ago
idk why but gemini acts very well on benchmarks, but it's clearly falling behind gpt and opus in actual tasks