|
|
|
|
|
by Const-me
1748 days ago
|
|
> they have 1.25MB of SRAM and 1TFlop of FP16/CFP8… This is woefully unequipped for the level of performance they want to achieve. Any idea how OP made that conclusion? My GeForce 1080Ti has 1.3MB of in-core L1 caches (28 streaming multiprocessors, 48kb L1 each). It also has L2 but not too large, slightly under 3MB for the whole chip. The GPU delivers about 10 TFlops of FP32 which needs 2x the RAM bandwidth of FP16. I’m generally OK with the level of performance, at least until the GPU shortage is fixed. |
|