Hacker News new | ask | show | jobs
by KptMarchewa 139 days ago
There is a difference between quantization of SOTA model and old models. People want non-quantized SOTA models, rather than old models.
1 comments

Put that all aside. Why can’t they demo a model on max load to show what it’s capable of…?

Yeah, exactly.