Hacker News new | ask | show | jobs
by Terretta 18 days ago
> single-request decode speed is now the metric that matters

Benched at 96 input tokens, 4000 output tokens.