Hacker News new | ask | show | jobs
by gdjdkslslp 499 days ago
I haven’t watched this video yet, but do you plan to create any technical videos in the (1) series on RL in LLMs?
2 comments

His intro to RL (not for LLM) blog post is a great read FYI

https://karpathy.github.io/2016/05/31/rl/

This would be very welcome as it brings us closer to understanding the secret sauce behind training a real, practical LLM.