Hacker News new | ask | show | jobs
by stephencanon 1085 days ago
You still get a perf benefit from half the memory traffic and keeping twice as much data in caches, since you can do the expansion to f32 when loading into registers.