|
|
|
|
|
by PartiallyTyped
174 days ago
|
|
Any reason for creating a new tensor when accumulating grads over updating the existing one? Edit: I asked this before I read the design decisions. Reasoning is, as far as I understand, that for simplificity no in-place operations hence accumulating it done on a new tensor. |
|
https://github.com/sueszli/autograd.c/blob/main/src/autograd...
i wonder whether there is a more clever way to do this without sacrificing simplicity.