Hacker News new | ask | show | jobs
by biotechbio 1240 days ago
Just finished assignment 2 of cs224n[1], which has you derive gradients and implement word2vec. I thought it was a pretty good exercise. You could read the glove paper and try implementing that as well.

Knowing how to step through backpropagation in a neural network gets you pretty far in conceptual understanding of a lot of architectures. Imo there’s no substitute for writing out the gradients by hand to make sure you get what’s going on, if only in a toy example.

[1] https://web.stanford.edu/class/cs224n/

1 comments

thanks!