Hacker News new | ask | show | jobs
by amelius 978 days ago
Regarding the vanishing gradient problem, has anyone tried to train using only a randomly chosen set of independent parameters in each iteration? (Updating only the weights in a small random independent set).
1 comments