> 2bit is pretty damn terrible
Wait till you go hybrid [0] or even 1bit [1]
[0] https://github.com/efeslab/Atom
[1] https://github.com/IST-DASLab/qmoe
> 2bit is pretty damn terrible
Wait till you go hybrid [0] or even 1bit [1]
[0] https://github.com/efeslab/Atom
[1] https://github.com/IST-DASLab/qmoe