Hacker News new | ask | show | jobs
by tipsytoad 125 days ago
Throughput is a metric for the total number of tokens/sec for all users in the system. Latency (ITL,TTFT) are individual user metrics.