Hacker News new | ask | show | jobs
by iknownthing 749 days ago
I assumed it was coming straight from the API because of the token-by-token generation effect but maybe you're right.
1 comments

The token-by-token responses are probably API, while the "instant" loads seem to be cached.

Probably using groq based on speed of response