Hacker News new | ask | show | jobs
by wmf 931 days ago
It sounds like they just recently got weight streaming working well which was my point: if you bought a CS-2 when it first came out you couldn't really use the off-chip memory and the on-chip memory wasn't enough to run LLMs.
2 comments

> It sounds like they just recently got weight streaming working well

Even if it was working poorly, the CS2 is still a lot of computer. The question is whether it was price-competitive with Nvidia at the time for the workloads it was acquired for.

Cerebras offers them in a batch-processing cloud-ish model, so their prices should reflect utility to some degree.

Interesting! But you think now that this is solved cerebras will be adopted more widely?
It's hard to say. We'll probably never have good information due to NDAs.