|
|
|
|
|
by heliophobicdude
1023 days ago
|
|
The biggest distinction in architecture between BERT and GPT is that BERT looks both ways from a given token. This helps give context to a token. This is what made BERT great at the time because the surrounding text, before and after, could change the meaning of the token we are at. You could essentially fill in the middle, or rather correct what's in the middle after it's been said. I believe this is why Apple is using it for iOS 17's auto-correct. GPT predicts the next word by only look back at what we have seen so far. In other words, it's auto regressive. |
|