Hacker News new | ask | show | jobs
by throwawayiionqz 2198 days ago
If subgradients are enough (-1 is correct subgradient at 0 in your example) then there are valid approaches for AD subgradient, see https://arxiv.org/abs/1809.08530
1 comments

Thanks for the link.