RLHF from Scratch | HN Mirror

Y	Hacker News new \| ask \| show \| jobs

	RLHF from Scratch (github.com)
	75 points by onurkanbkrc 133 days ago

3 comments

fauria 132 days ago

RLHF: Reinforcement learning from human feedback - https://en.wikipedia.org/wiki/Reinforcement_learning_from_hu...

link

alansaber 133 days ago

Looks good. I am a big advocate for these hands on demos as being the best way for beginners to learn ML

link

vivzkestrel 132 days ago

i prefer things that can explain stuff to me visually like this post here https://mlu-explain.github.io/neural-networks/ wouldnt it be nice if someone could actually cook every type of neural network in that format?

link