|
|
|
|
|
by vinn124
2939 days ago
|
|
> Many losses which don't seem differentiable can be reformulated as such... agreed, especially with policy gradients. > If the dimensionality is small, second-order methods (or approximations thereof) can do dramatically better yet. i have not seen second order derivatives in practice, presumably due to memory limitations. can you point me to examples? |
|