Hacker News new | ask | show | jobs
by ericd 623 days ago
Interesting that they’ve scaled on-chip memory sublinearly with the growth of transistors between their generations, I would’ve thought they would try to bump that number up. Maybe it’s not a major bottleneck for their training runs?
1 comments

SRAM is scaling significantly more slowly than logic in recent process nodes.
Ahh that explains it, thanks. Seems like a potentially large problem given their strategy.
They could use something like GCRAM[1] to double capacity if they had to...but it's not clear how much worse performance would be.

[1]https://raaam-tech.com/products/

The performance doesn't look great (yet). See Fig. 7

https://www.eng.biu.ac.il/fishale/files/2020/12/A-1-Mbit-Ful...

Cerebras runs at 1.1 GHz[1], and this was a much earlier design on 16nm so it might be a good fit by now. Their TSMC 5 nm version is scheduled for early 2025.[2]

[1]https://cerebras.ai/blog/cerebras-architecture-deep-dive-fir...

[2]https://www.eenewseurope.com/en/raaam-signs-lead-licensee-fo...

They'd have to quadruple their performance to be relevant in the market generally, here's to hoping.