|
|
|
|
|
by ryao
384 days ago
|
|
Last year, I took the time to read through public documents and estimated that their annual production was limited to ~300 wafers per year from TSMC. That is not Nvidia level scale, but it is scale. There are many companies that sell tokens from an API and many more that need hardware to compute tokens. Cerebras posted a comparison of hardware options for these companies, so evaluating it as such is meaningful. It is perhaps less meaningful to the average person who cannot afford the barrier to entry to afford this hardware, but there are plenty of people curious what the options are for the companies that sell tokens through APIs, as those impact available capacity. |
|
I was just at Dell Tech World and they proudly displayed a slide during the CTO keynote that said:
"Cost per token decreased 4 orders of magnitude"
Personally speaking, not a business I'd want to get into.