Y
Hacker News
new
|
ask
|
show
|
jobs
by
FezzikTheGiant
547 days ago
what's the cost difference between groq/cerebras vs using something else for inferencing open source models? I'm guessing the speed comes at a cost?
2 comments
el_isma
546 days ago
0.6/1$ per M tokens in groq/cerebras vs 0.3$ per M tokens in deepinfra (for llama 3.3 70b)
But note the free tiers for groq and cerebras are
very
generous.
link
ilaksh
547 days ago
I don't know off the top of my head, only played with it a little not seriously.
link
FezzikTheGiant
547 days ago
fair enough
link
But note the free tiers for groq and cerebras are very generous.