| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by sigmoid10 1097 days ago
	Also quasi-linear activation functions (prevent vanishing gradients), tons of regularisation (e.g convolutions) and more adaptive gradient descent (faster convergence). I've still met people in the early 2010s who tried to make neural networks work using only a few dozen units. Academia is pretty slow. What people also forget is that libraries like pytorch or tensorflow simply didn't exist. I wrote my own neural network stacks complete with backpropagation from scratch in c++ back then.

1 comments

bravura 1096 days ago

LeCun et al (1989) had backprop working for digit recognition.

LeCun, Bottou, et al (2002) in "Efficient Backprop" described techniques for improving backprop algorithms.

link

sigmoid10 1096 days ago

Rosenblatt had a working perceptron for classifying images in the 1950s (!). And yet it took 60 years before the theory and compute power had developed enough for all of this to be interesting outside of small, purely academic experiments.

link

bravura 1095 days ago

Handwriting recognition on checks (LeCun et al 1989) wasn't really a small, purely academic experiment

link

sigmoid10 1094 days ago

And yet classical OCR techniques continued to dominate. Nothing happened in the industry on that front for over 20 years. That's as academic as it gets.

link