|
|
|
|
|
by nickstinemates
60 days ago
|
|
It feels so weird to me - people are exhausting their quotas while I am trying very hard to even reach mine with the $200 plan. We're generating all of the code for swamp[1] with AI. We review all of that generated code with AI (this is done with the anthropic API.) Every part of our SDLC is pure AI + compute. Many feature requests every day. Bug fixes, etc. Never hit the quota once. Something weird is definitely going on. 1: https://github.com/systeminit/swamp |
|
But people who go > 5 minutes between prompts and see no cache, usage is eaten up quickly. Especially passing in hundreds of thousands of tokens of conversation history.
I know my quote goes a lot further when I sit down and keep sessions active, and much less far when I’m distracted and let it sit for 10+ minutes between queries.
It’s a guess. But n=1 and possible confirmation bias noted, it’s what I’m seeing.