Hacker News new | ask | show | jobs
by lars 971 days ago
I basically share your sentiment. However, Greg Yangs work seems to have produced something of direct practical benefit for training large neural nets, based on the NTK literature. ยต-Parametrization is apparently very useful in real world practice.