Hacker News new | ask | show | jobs
by abtinf 84 days ago
The lack of a token rate metric for the kimi example is disappointing.
1 comments

The latter link says they get ~1.7 tok/s which is quite impressive for a near-SOTA local model running on ordinary hardware.