Hacker News new | ask | show | jobs
by abhijat 158 days ago
Isn't the context window the same for all plans, 200k? You would hit usage limits?
1 comments

If you send the full 200k tokens on every request you will get very few requests before you hit the token limit. Caching reduces the number sent but I don't know how much they can cache?