Hacker News new | ask | show | jobs
by TrainedMonkey 103 days ago
Will need to wait for real benchmarks, but based on OpenAI marketing Instant is their latency optimized offering. For voice interface, you don't actually need high tok/s because speech is slow, time to first token matters much more.