Hacker News new | ask | show | jobs
by fjkdlsjflkds 1296 days ago
No. You got the right impression. It is indeed doing "next token prediction" in an autoregressive way, over and over again.

The best source would be the GPT-3 paper itself: https://paperswithcode.com/method/gpt-3