Hacker News new | ask | show | jobs
by ironrabbit 1253 days ago
Andrej's "Building Makemore" series is exactly this, it includes a wonderful lecture where he computes all the gradients for a simple network by hand and compares them against the values produced by torch's autograd.
1 comments

I find Makemore to be my favorite neural-net-based (w)rapper.