| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by FezzikTheGiant 594 days ago
	what's the cost difference between groq/cerebras vs using something else for inferencing open source models? I'm guessing the speed comes at a cost?

2 comments

0.6/1$ per M tokens in groq/cerebras vs 0.3$ per M tokens in deepinfra (for llama 3.3 70b)

But note the free tiers for groq and cerebras are very generous.

I don't know off the top of my head, only played with it a little not seriously.

fair enough