Hacker News new | ask | show | jobs
by scotty79 3 days ago
> It climbed to 84 tok/s, then hit a wall, insisting further optimization was impossible.

> Hours later, Anthropic rolled back invisible LLM development safeguards, and it hit 255 tok/s.

Wow. Limitnig access to models for other reasons than that you can't physically provide it should be a crime against humanity or the planet or something. So much immediate efficency left on the table for stupid reasons.