Hacker News new | ask | show | jobs
by woadwarrior01 932 days ago
Interesting! This also seems to work with smaller quantised models. I just tried it with a 4-bit quantised version of WizardLM 13B v1.2 and it seems to work quite well.