|
|
|
|
|
by jsheard
217 days ago
|
|
Cheap enough for now, but of all the companies selling inference at a loss, Cerebras and Groq are probably losing the most per token. Their hardware is ungodly expensive and its reliance on huge amounts of SRAM bottlenecks how much cheaper it can get, since SRAM density is improving at a snails pace at this point. |
|
But I'm just reasoning from first principles. I don't have any specific data about them.