Hacker News new | ask | show | jobs
by hyperbovine 3495 days ago
You can just use subgradient descent. Nonconvex loss would pose a bigger problem.