|
|
|
|
|
by Matticus_Rex
904 days ago
|
|
It turns out you can reproduce articles with next-token prediction when the articles are quoted all over the dataset. The articles themselves are indisputably not a part of the model, because it doesn't store text at all. OpenAI's position is correct; people just underestimated how well the AI learns from reading, especially when it reads the same text in a bunch of different places because it's being quoted/excerpted. |
|