Hacker News new | ask | show | jobs
by lucidrains 807 days ago
I can't think of anyone better to teach attention mechanism to the masses. This is a dream come true
1 comments

Incredible. This 3B1B series was started 6 years ago and keeps going today with chapter 5.

If you haven't seen the first few chapters, I cannot recommend enough.

Would you be able to compare them to Andrew Ng's course?
IMO the style, formatting, and animations in 3B1B videos is what Coursera courses should have been about in the first place.

Andrew Ng's course doesn't use video effectively at all: half of each class is Andrew talking to the camera, while the other half is him slowly writing things down with a mouse. There's a reason why a lot of people recommend watching at 1.5x speed.

Online classes are online classes. If they try to make copy in-person classes, like most Coursera courses do, they will keep all of the weaknesses of online classes without any of its strengths.

I personally preferred Andrej Karpathy's CS231n taught by him and his private videos about neural nets in general and transformers in particular. He has a youtube vid where he builds one from scratch in Python!

3BlueOneBrown videos are a great complement to Karpathy's lectures to aid in visualising what is going on.

IMO I think the 3Blue1Brown video is a good place to start to build intuitions about how things work generally if you're new, and Andrew Ng's courses will help you dig into more detail, experiment, and implement things to build on those intuitions.
They're not really comparable - if you're wondering if you should do one or the other, you should do both.
The way you compare a technical drawing of a steam engine to The Fighting Temeraire oil painting.