Y
Hacker News
new
|
ask
|
show
|
jobs
by
tome
848 days ago
I don't know about GPT 3.5 specifically, but on this independent benchmark (LLMPerf) Groq's time to first token is also lowest:
https://github.com/ray-project/llmperf-leaderboard?tab=readm...