|
|
|
|
|
by Radim
3261 days ago
|
|
Yep, that's the one. By the way, this "local-changes-only" approach crops up in many other places too. In CS and nature, because it's just so damn energy-efficient. For example, check out this (recent, July 2017) paper: Gomez, Ren, Urtasun & Grosse: "The Reversible Residual Network: Backpropagation Without Storing Activations", https://arxiv.org/abs/1707.04585. You can't help but think of the ancient programmer's trick for swapping two variables without a temp storage... a = a + b;
b = a - b;
a = a - b;
|
|