Hacker News new | ask | show | jobs
by throwaway1851 1190 days ago
$100.

“For our initial run, fine-tuning a 7B LLaMA model took 3 hours on 8 80GB A100s, which costs less than $100 on most cloud compute providers. We note that training efficiency can be improved to further reduce the cost.”

($500 was what they paid OpenAI to generate the fine-tuning dataset.)