Hacker News new | ask | show | jobs
by aoeusnth1 32 days ago
We're discussing whether they are models or not, not whether they have goals and agency. A language model does form a model of who you are and what you're thinking, because language is causally connected to those aspects of the generating distribution and modeling those aspects reduces cross-entropy.

RL provides the goals and agency. Pretraining provides the model.