Hacker News new | ask | show | jobs
by bob1029 2 days ago
I started doing some numbers around the scale of tokens per second we can generate with figures like 300 million watts and I really don't understand the destination anymore. I see that Anthropic is somehow constrained in the news, but that doesn't line up with headlines. Everything seems off by 3-4 orders of magnitude here. I realize there are some users of AI who can burn a million tokens like it's nothing, but these facilities can produce trillions (10^12) per day.

I feel like there is some use case planned here that isn't to be known about until it's way too late to do something about it. Or this is a very serious bubble. One of the two or some really horrible blend.

3 comments

I'd be really interested in seeing your numbers, I've thought about this.

I do feel like there are some use cases which are cost constrained right now, but that area is getting smaller as local models get better.

I haven't done any math, but if you have individual users that can burn millions of tokens a day, then it does not take very many of them (at SaaS scale even limiting to power users) to hit trillions. And even fewer to run into problems specifically with time of use.
Also consider it's a pyramid scheme of AI users using AI to generate AI apps filled with AI features powered by OpenAI and Anthropic APIs monitored and fixed by AI tools and automation.

Meanwhile product managers and sales busy drumming up new features with AI

Did you factor in model training and peak vs avg throughput?

Also yes, it is serious bubble.