Hacker News new | ask | show | jobs
by rodiger 745 days ago
The token-by-token responses are probably API, while the "instant" loads seem to be cached.

Probably using groq based on speed of response