Hacker News new | ask | show | jobs
by nextts 446 days ago
Yes. There are 2 aspects to this.

Roughly (from lay understand) LLMs predict what their training data would say. They are first trained on "the internet, etc." so they can predict words well, e.g. finish off "Paris is the..." then using human feedback they are trained further to work in chat mode and be non-offensive, concise, be pleasant etc.