Hacker News new | ask | show | jobs
RLHF from Scratch (github.com)
75 points by onurkanbkrc 133 days ago
3 comments

RLHF: Reinforcement learning from human feedback - https://en.wikipedia.org/wiki/Reinforcement_learning_from_hu...
Looks good. I am a big advocate for these hands on demos as being the best way for beginners to learn ML
i prefer things that can explain stuff to me visually like this post here https://mlu-explain.github.io/neural-networks/ wouldnt it be nice if someone could actually cook every type of neural network in that format?