Y
Hacker News
new
|
ask
|
show
|
jobs
by
mikeravkine
1059 days ago
Outliers only begin to appear around 3B parameters (as per the original LLM.int8 paper) so unfortunately not consumer GPU in an afternoon kinda stuff to prove you've managed to suppress them.