We are going to see it happen without something like next generation Groq chips. IIUC Groq can't run actually large LMs, the largest they offer is 70B LLaMA. DeepSeek-R1 is 671B.
Aha, for some reason I thought they provided full-size Llama through some bundling of multiple chips. Fair enough then, anyway long term I feel like providers running powerful open models on purpose built inference ASICs will be really awesome.