|
|
|
|
|
by AstralStorm
3008 days ago
|
|
The problem is with gradient propagation which requires approximately associative mathematics. You will be training your network something else than you wanted to. Toy evaluate the network in two orders - forward (use) and backward (training). |
|