It sounds like they just recently got weight streaming working well which was my point: if you bought a CS-2 when it first came out you couldn't really use the off-chip memory and the on-chip memory wasn't enough to run LLMs.
> It sounds like they just recently got weight streaming working well
Even if it was working poorly, the CS2 is still a lot of computer. The question is whether it was price-competitive with Nvidia at the time for the workloads it was acquired for.
Cerebras offers them in a batch-processing cloud-ish model, so their prices should reflect utility to some degree.
Even if it was working poorly, the CS2 is still a lot of computer. The question is whether it was price-competitive with Nvidia at the time for the workloads it was acquired for.
Cerebras offers them in a batch-processing cloud-ish model, so their prices should reflect utility to some degree.