|
|
|
|
|
by smaddox
3353 days ago
|
|
There's good reason to believe that different parts of the brain are quasi-specialized to particular applications. In a sense, this applies to particular software artificial neural networks (ANN) as well, particularly if the various hyper-parameters are fixed (number of neural units per layer, etc). One of the primary advantages of software ANNs over hardware ANNs (which don't really exist yet) would be the ability to easily change the hyper parameters. Hardware implementations of ANNs, such as might be designed based on these FTJ-based artificial synapses, would have some fixed hyper parameters, and thus would be pseudo-specialized. This disadvantage could potentially be more than compensated for by a dramatic learning speedup and power-usage reduction. Transistors are highly scaled and low power, but it takes a lot of them and a lot of time to simulate each neural unit. On a separate note, the best-performing software ANNs don't emulate spike time dependant plasticity, which is believed to be the primary learning mechanism of the human brain. Instead, they use variations of backpropagation and gradient descent, which is almost certainly not how the human brain learns. It remains to be fully understood how the two compare at various tasks. Most likely, they will have different strengths and weaknesses, making each useful in their own right. |
|
The possibility space between relatively simple and insufficiently general unsupervise/clustering approaches and rigid SGD schemes is large, and probably contains the brain's true inference engine. Personally, I am excited by some of the ideas brought forward in this Bengio paper: https://arxiv.org/pdf/1602.05179.pdf