Hacker News new | ask | show | jobs
by ACCount37 235 days ago
"Just" is the wrong way to put it.

Pretraining teaches LLMs everything. SFT and RL is about putting that "everything" into useful configurations and gluing it together so that it works better.