Hacker News new | ask | show | jobs
by canjobear 2194 days ago
The pretraining approach was used in vision for years before it was successful in NLP.
1 comments

Not really on unsupervised/self-supervised data though, right?

(nor on the same scale of corpora, as far as I can tell)