|
|
|
|
|
by fathead_glacier
2262 days ago
|
|
You can refer to a talk about Differentiable Programming here: https://www.youtube.com/watch?v=Sv3d0k7wWHk the talk is for Julia, albeit the principles are general. If you skip to https://youtu.be/Sv3d0k7wWHk?t=2877 you can see a question asked: why would you need this. In essence there are cases outside the well developed uses (CNN, LSTM etc.) such as Neural ODEs where you need to mix different tools (ODE solvers and neural networks) and the ability to do Differentiable Programming is helpful otherwise it is harder the get gradients. The way I can see it being useful is that it helps speed up development work so we can explore more architectures, again Neural ODEs being a great example. |
|
Erik Meijer gave two great talks on the concept.
https://m.youtube.com/watch?v=NKeHrApPWlo
https://m.youtube.com/watch?v=13eYMhuvmXE