|
|
|
|
|
by jonas21
609 days ago
|
|
It's not like people had been ignoring Yann LeCun's work prior to AlexNet. It received quite a few citations and was famously used by the US Postal Service for reading handwritten digits. AlexNet happened in 2012 because the conditions necessary to scale it up to more interesting problems didn't exist until then. In particular, you needed: - A way to easily write general-purpose code for the GPU (CUDA, 2007). - GPUs with enough memory to hold the weights and gradients (~2010 - and even then, AlexNet was split across 2 GPUs). - A popular benchmark that could demonstrate the magnitude of the improvement (ImageNet, 2010). Additionally, LeCun's early work in neural networks was done
at Bell Labs in the late 80s and early 90s. It was patented by Bell Labs, and those patents expired in the late 2000s and early 2010s. I wonder if that had something to do with CNNs taking off commercially in the 2010s. |
|