Hacker News new | ask | show | jobs
by redox99 1015 days ago
Unrelated. What matters for that is prompt processing time (which is in the high hundreds of tokens per second).