Hacker News new | ask | show | jobs
by pclmulqdq 380 days ago
Oh yeah, I did that math not assuming any quantization. I think if you can get a 3-4 bit quant working + int8 math, ~80 might be achievable.