Hacker News new | ask | show | jobs
by FezzikTheGiant 547 days ago
what's the cost difference between groq/cerebras vs using something else for inferencing open source models? I'm guessing the speed comes at a cost?
2 comments

0.6/1$ per M tokens in groq/cerebras vs 0.3$ per M tokens in deepinfra (for llama 3.3 70b)

But note the free tiers for groq and cerebras are very generous.

I don't know off the top of my head, only played with it a little not seriously.
fair enough