|
|
|
|
|
by Componica
602 days ago
|
|
The Yann LeCun paper 'Gradient-Based Learning Applied to Document Recognition' specified the modern implementation of a convolutional neural network and was published in 1998. AlexNet, which woke up the world to CNNs, was published in 2012. Between that time in the early 2000s I was selling implementations of really good object classifiers and OCRs. |
|
AlexNet happened in 2012 because the conditions necessary to scale it up to more interesting problems didn't exist until then. In particular, you needed:
- A way to easily write general-purpose code for the GPU (CUDA, 2007).
- GPUs with enough memory to hold the weights and gradients (~2010 - and even then, AlexNet was split across 2 GPUs).
- A popular benchmark that could demonstrate the magnitude of the improvement (ImageNet, 2010).
Additionally, LeCun's early work in neural networks was done at Bell Labs in the late 80s and early 90s. It was patented by Bell Labs, and those patents expired in the late 2000s and early 2010s. I wonder if that had something to do with CNNs taking off commercially in the 2010s.