Hacker News new | ask | show | jobs
by kbrkbr 661 days ago
But isn't it the beauty of llm's that they need comparably little preparation (unstructured text as input) and pick the features on their own so to say?

edit: grammar

1 comments

Yes, if you want an LLM that doesn't listen to instructions and just endlessly babbles about anything and everything.

What turned GPT into chatGPT was a lot of structured training with human feedback.

Exactly. Section 4.3.7 briefly explains how they trained the model to better follow instructions ('steerability').