|
|
|
|
|
by ViralBShah
2186 days ago
|
|
That is a pretty fun architecture. I hope that opens the door to higher performance for more workloads than top500. At least with the top500 benchmark, the bandwidth is not a problem, so long as you can do a large enough problem. Since it is a linear solve that spends all its time doing matmul (n^3 operations on n^2 data), so long as the problem is big enough, you can saturate the cores. |
|