Hacker News new | ask | show | jobs
by zozbot234 84 days ago
The latter link says they get ~1.7 tok/s which is quite impressive for a near-SOTA local model running on ordinary hardware.