Hacker News new | ask | show | jobs
by superkuh 592 days ago
Except in this case they quantized both the parameters and the activations leading to decreased compute time too.