Hacker News new | ask | show | jobs
by brookst 1128 days ago
What I meant was, fine tuning is not substantially different from training. It seems odd to use different words for the resulting systems.
1 comments

But fine-tuning is very different from (pre)training. Pretreating proceeds via unsupervised learning on massive amounts of data and compute, while fine-tuning uses much smaller amounts, with supervised learning (instruction tuning) and reinforcement learning (RLHF, constitutional AI).