Hacker News new | ask | show | jobs
by JJneid 742 days ago
usually performance takes a hit with quantization. are you getting quality responses?
1 comments

Since llama3, yes, quite satisfying.