|
|
|
|
|
by bitL
3171 days ago
|
|
A question about an opposite problem - is there a way to do this (and the whole Deep Learning) on discrete domains? So far all I've seen assumes continuous functions to be able to perform back-propagation; haven't seen anyone using discrete calculus with similar rules to continuous one (see Graham/Knuth/Patashnik). That could open many more interesting applications... |
|
Trying to learn discrete rules is harder because the learning procedure uses gradients to adjust parameters, and the gradients will be zero in a lot more places with discrete "rules".
Gradient Boosted Trees are probably the main thing that comes to mind, but they're not really deep learning.
People have tried to learn hard vs soft attention mechanisms, and while hard attention is faster, it results in worse accuracy and is harder to train.
The inference I draw is that most of the things we want to learn are not described well by discrete rules.