Hacker News new | ask | show | jobs
by _aavaa_ 15 days ago
On the front page right now is the newest announcement from Xiaomi serving large model at over 1,000 tok/s on standard server gpus.

Every facet of the field is being pushed on and advanced at the same time.