|
|
|
|
|
by IshKebab
2361 days ago
|
|
I'm afraid recursivecaveat is right. This is an insanely difficult compilation target. I think you're possibly talking about a different kind of "compilation" - i.e. the Clang/GCC bit that converts C++ to machine code. That is indeed trivial. But "compilation" for these chips includes much more than that. The really complicated bit is converting the tensorflow model to some kind of computation plan. Where do you put all the tensor data? How do you move it around the chip. It's insanely complicated. If anything kills Cerebras it will be the software. |
|
https://secureservercdn.net/198.12.145.239/a7b.fcb.myftpuplo...
Then you put your data next to the core that uses it. Simples.
(Optimal placement is tricky, but approximate techniques work fine.)