Hacker News new | ask | show | jobs
by WatchDog 58 days ago
All of their benchmarks are against 16 bit models right?

Why aren't they comparing to 2/3/4 bit quants?

1 comments

looked at quant versions of these models and they all outperform it so I guess it just doesn't look as good.