| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by simonw 817 days ago
	"finetuning a base model costs ~$1 with LORA/QLORA and can yield same performance as gpt-4, but at 1/100 of the cost per token" That's a big claim - can you back that up with any examples?

2 comments

Implicated 817 days ago

I had opened a new tab back when this comment was just a few minutes old in hopes that when I came back there was some really great blog post linked with the details on the sorcery.

link

hallqv 816 days ago

https://arxiv.org/pdf/2402.00841.pdf

link