Y
Hacker News
new
|
ask
|
show
|
jobs
by
ZeroCool2u
110 days ago
Frontier Math, GPQA Diamond, and Browsecomp are the benchmarks I noticed this on.
1 comments
csnweb
110 days ago
Are you may be comparing the pro model to the non pro model with thinking? Granted it’s a bit confusing but the pro model is 10 times more expensive and probably much larger as well.
link
ZeroCool2u
110 days ago
Ah yes, okay that makes more sense!
link