|
|
|
|
|
by bob1029
2 days ago
|
|
I started doing some numbers around the scale of tokens per second we can generate with figures like 300 million watts and I really don't understand the destination anymore. I see that Anthropic is somehow constrained in the news, but that doesn't line up with headlines. Everything seems off by 3-4 orders of magnitude here. I realize there are some users of AI who can burn a million tokens like it's nothing, but these facilities can produce trillions (10^12) per day. I feel like there is some use case planned here that isn't to be known about until it's way too late to do something about it. Or this is a very serious bubble. One of the two or some really horrible blend. |
|
I do feel like there are some use cases which are cost constrained right now, but that area is getting smaller as local models get better.