Y
Hacker News
new
|
ask
|
show
|
jobs
by
lazerlapin
100 days ago
With 1.58-bit ternary quantization, you may think you're running a big model but really you're just running a "mini" version of it