| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by whereismyacc 551 days ago
	Neither of the deepseek models are on Groq yet, but when/if they are, that combination makes so much sense. A high quality open reasoning model, but you compensate for the slow inference of reasoning models with fast ASICs.

1 comments

rfoo 551 days ago

We are going to see it happen without something like next generation Groq chips. IIUC Groq can't run actually large LMs, the largest they offer is 70B LLaMA. DeepSeek-R1 is 671B.

link

whereismyacc 551 days ago

Aha, for some reason I thought they provided full-size Llama through some bundling of multiple chips. Fair enough then, anyway long term I feel like providers running powerful open models on purpose built inference ASICs will be really awesome.

link