Hacker News new | ask | show | jobs
by pure-awesome 3062 days ago
The basic mechanisms for building a neural network from scratch are almost disappointingly simple (provided you know a little bit of calculus and linear algebra). And setting up a basic network in an existing architecture is pretty trivial.

I'm currently busy with the neural networks and deep learning specialization on Coursera.

https://www.coursera.org/specializations/deep-learning

The trick, as far as I can tell, lies in with the various techniques for setting up your data, tuning your hyperparameters, and picking the right architecture for the job. At least, this seems to be the message of the course. It seems to still be a bit of an ad-hoc field. There are a number of techniques and things to try, without there necessarily being more than a shallow theoretical understanding from the experts as to why they actually work.

Then, of course, there are the experts and researchers who come up with entirely new architectures. Now that actually takes skill.

1 comments

Isn't this literally the point of computers? A processor is something that does amazingly stupid stuff 1.3 million times a second.

If you do even very simple things rapidly enough, you can get amazing results.