Hacker News new | ask | show | jobs
by tcdent 607 days ago
These undergo additional fine tuning (QLoRA) using some or all of the original dataset, so they're able to get the weights to align to the nf4 dtype better, which increases the accuracy.