Hacker News new | ask | show | jobs
by rfoo 504 days ago
We are going to see it happen without something like next generation Groq chips. IIUC Groq can't run actually large LMs, the largest they offer is 70B LLaMA. DeepSeek-R1 is 671B.
1 comments

Aha, for some reason I thought they provided full-size Llama through some bundling of multiple chips. Fair enough then, anyway long term I feel like providers running powerful open models on purpose built inference ASICs will be really awesome.