Hacker News new | ask | show | jobs
by stavros 61 days ago
Better go for a less-quantized model even if it's slower than go for a faster, quantized one.
1 comments

Thank you!