|
|
|
|
|
by rmwaite
978 days ago
|
|
Isn’t it worth considering that the reason ChatGPT can do those things is that it was trained on data from platforms like Stack Overflow? This is hard to quantify but my guess would be that without the SO data it wouldn’t be as useful. |
|
This is why reddit and twitter both locked down their APIs, due to the data being very high quality and immensely valuable for training.
Too bad they all did it too late, no one saw ChatGPT coming. And since all the data was already scraped from Stackoverflow, it no longer has any value for OpenAI. Stackoverflow is rapidly declining in volume, so future data from it is irrelevant.