|
|
|
|
|
by qeternity
921 days ago
|
|
Yeah I call BS on this. This does nothing to address the main issues with autoregressive transformer models (memory bandwidth). GPU compute units are mostly sitting idle these days waiting for chip cache to receive data fr VRAM. This does nothing to solve that. |
|