Y
Hacker News
new
|
ask
|
show
|
jobs
by
simonw
817 days ago
"finetuning a base model costs ~$1 with LORA/QLORA and can yield same performance as gpt-4, but at 1/100 of the cost per token"
That's a big claim - can you back that up with any examples?
2 comments
Implicated
817 days ago
I had opened a new tab back when this comment was just a few minutes old in hopes that when I came back there was some really great blog post linked with the details on the sorcery.
link
hallqv
816 days ago
https://arxiv.org/pdf/2402.00841.pdf
link