Hacker News new | ask | show | jobs
by rmwaite 978 days ago
Isn’t it worth considering that the reason ChatGPT can do those things is that it was trained on data from platforms like Stack Overflow? This is hard to quantify but my guess would be that without the SO data it wouldn’t be as useful.
2 comments

You are correct.

This is why reddit and twitter both locked down their APIs, due to the data being very high quality and immensely valuable for training.

Too bad they all did it too late, no one saw ChatGPT coming. And since all the data was already scraped from Stackoverflow, it no longer has any value for OpenAI. Stackoverflow is rapidly declining in volume, so future data from it is irrelevant.

It is not worth considering that because that is irrelevant to the fact that ChatGPT is a competitor now.