Y
Hacker News
new
|
ask
|
show
|
jobs
by
hyperknot
371 days ago
I got 700+ tokens/sec on o3 after the announcement, I suspect it's very much a quantized version.
https://x.com/hyperknot/status/1932476190608036243
3 comments
dist-epoch
371 days ago
Or maybe they just brought online much faster much cheaper hardware.
link
az226
370 days ago
Or they are using a speedy add-on decoder.
link
beering
370 days ago
Do you also have numbers on intelligence before and after?
link
zackangelo
371 days ago
Is that input tokens or output tokens/s?
link