|
|
|
|
|
by sigbottle
22 days ago
|
|
What would unsupervised mean, would unsupervised be something like alphago playing against itself trillions of times? Whereas self-supervised, allows learning without explicit annotation of data ; but it doesn't matter if the models already trained on the entire Internet, and it's not like a game where it can come up with effectively new training data for itself? |
|
Unsupervised is something where there is no intrinsic reward signal. In pre training, predicting the next token and seeing that it matches is a reward signal, hence it is self supervised.