1. https://www.assemblyai.com/blog/the-full-story-of-large-lang... (you can scroll to "What RLHF actually does to an LLM" if you're already familiar with LLMs)
2. https://www.assemblyai.com/blog/how-chatgpt-actually-works/