Hacker News new | ask | show | jobs
by tartrate 798 days ago
Well, since it affects inference speed it means you can handle more in less time, needing less concurrency.