|
|
|
|
|
by frozenport
742 days ago
|
|
Well I've been using the groq public api, and its approx. the rates claimed. Economics and costs are hard to predict.
For example, Groq is not using HBM chips. So probably the cards are a lot easier to source. Its not clear what the capacity of these systems are in terms of total users, or even tokens per second. Then you factor in cost. Then you realize all vendors will match a competitors pricing. Then you realize Groq doesn't sell chips. ¯\_(ツ)_/¯ The only thing you have is the public API to benchmark against: https://artificialanalysis.ai/ |
|
- SambaNova has real revenue from big customers - SambaNova can run any model on a single node at the speed Groq requires - SambaNova can do low latency inference just like Groq, but can also run large batches and host hundreds of models on a single deployment - SambaNova does not quantize models unless explicitly stated - SambaNova can run training at perf competitive with Nvidia, as well as fastest inference in the world at full precision
It really isn't a competition. Groq has done great as garnering hype in recent months, but it is a house of cards.