Hacker News new | ask | show | jobs
by philovivero 1060 days ago
Running a model isn't binary, it's per amount of time spent generating tokens.