Hacker News new | ask | show | jobs
by H8crilA 1101 days ago
This says nothing on how RLHF works, but a lot on what can be the results.
2 comments

You can check here for an explanation (with some helpful figures) https://www.assemblyai.com/blog/the-full-story-of-large-lang...
Yes! I came to make the same comment.

It's got a catchy title but it leaves much to be resolved.