|
|
|
|
|
by blackbear_
255 days ago
|
|
Not sure I am following; typical neural network training via stochastic gradient descent does not require Jacobian inversion. Less popular techniques like normalizing flows do need that but instead of SVD they directly design transformations that are easier to invert. |
|
QPs are solved by finding the roots (aka zeroes) of the KKT conditions, basically finding points where the derivative is zero. This is done by solving a linear system of equations Ax=b. Warm starting QP solvers try to factorize the matrices in the QP formulation through LU decomposition or any other method. This works well if you have a linear model, but it doesn't if the model changes, because your factorization becomes obsolete.