Hacker News new | ask | show | jobs
by vletal 552 days ago
That would be 1.38 bits per weight on average, which I can confidently guess would not perform well.
2 comments

BitNet is functional at 1.58 bpw.
The model card says the 70B is 16 bit so I think you have twice that