Hacker News new | ask | show | jobs
by _0ffh 3693 days ago
"The old argument was that unsupervised pretraining helps get proper weights faster, but this has largely been disproven."

Do you hold that to be true in general, or only when using dropout?