Hacker News new | ask | show | jobs
by santaclaus 3113 days ago
> Newtons method

Where are you seeing Newton's method? I didn't think second order information was available for typical systems in statistical machine learning.

1 comments

my understanding is that the issue is that the full Hessian of the loss is too expensive to compute at each step for the relative size of the increase in learning speed
Yeah I think that's why quasi-Newton methods like BFGS have been developed.