Hacker News new | ask | show | jobs
by SleekEagle 1101 days ago
My colleague wrote a couple of pieces that talk about RLHF:

1. https://www.assemblyai.com/blog/the-full-story-of-large-lang... (you can scroll to "What RLHF actually does to an LLM" if you're already familiar with LLMs)

2. https://www.assemblyai.com/blog/how-chatgpt-actually-works/