Y
Hacker News
new
|
ask
|
show
|
jobs
by
sigotirandolas
1030 days ago
I assume he means quantization (e.g. scaling the weights from 16-bit to 4-bit) and it speeds up the output by reducing the amount of work done.