Hacker News new | ask | show | jobs
by Art9681 18 days ago
Except the latency is significant and not suitable for clients with advanced agent features. The experience between using a frontier model via first party API and the best open weight models via OpenRouter is night and day. Can't get any real work done with it.
1 comments

Good point. When I use it, the inference doesn't seem very fast compared to the big providers, esp Time to First (non-reasoning)Token.