Y
Hacker News
new
|
ask
|
show
|
jobs
by
tcdent
607 days ago
These undergo additional fine tuning (QLoRA) using some or all of the original dataset, so they're able to get the weights to align to the nf4 dtype better, which increases the accuracy.