|
|
|
|
|
by marmaduke
2780 days ago
|
|
> PyTorch, you still do have to define the backward function and worry about bookkeeping the gradient, clearing gradient values at the appropriate time, and explicitly calling to calculate these things in verbose optimizer invocation code I’ve definitely never had to do that. Where do you get this from? |
|