Hacker News new | ask | show | jobs
by cold_harbor 24 days ago
the real lesson: GPUs win on memory bandwidth not just FLOPs. batching ops keeps VRAM fed at 2TB/s instead of tripping to RAM at 50GB/s for every operation