|
|
|
|
|
by luke-stanley
700 days ago
|
|
"Stack Overflow is no longer uploading the data dump to archive.org."
"We would really rather users do not upload the file to archive.org or similar data pile sites."
They have no way to stop people from doing that under the license. Only kind words. Since they've made it deliberately hard for people to train on, I'd be really surprised if people didn't put it on Archive.org and HuggingFace Datasets. So long as it's under the license, it should be fine, right?
I am not a lawyer.
What they said about access speed issues makes little sense to me, I torrented their dumps before just fine and was very happy to seed it. |
|