Hacker News new | ask | show | jobs
by menzoic 1188 days ago
Stanford only spent $500 to fine-tune LLAMA for humam instruction with 52k instructions generated by GPT-3. This probably costs less. The use of GPT to generate the instruction data instead of humans is the massive cost reduction. The actual training for fine-tuning on GPUs is relatively cheap.
1 comments

Most of that was getting the data, the training would cost something like $100 if memory serves.