Hacker News new | ask | show | jobs
by whereismyacc 504 days ago
Neither of the deepseek models are on Groq yet, but when/if they are, that combination makes so much sense. A high quality open reasoning model, but you compensate for the slow inference of reasoning models with fast ASICs.
1 comments

We are going to see it happen without something like next generation Groq chips. IIUC Groq can't run actually large LMs, the largest they offer is 70B LLaMA. DeepSeek-R1 is 671B.
Aha, for some reason I thought they provided full-size Llama through some bundling of multiple chips. Fair enough then, anyway long term I feel like providers running powerful open models on purpose built inference ASICs will be really awesome.