| Hi all. I would like your help with an ethics problem. I run a site called The Nose, a safe haven for AI training data. It operates overseas in a region out of reach of DMCAs. (Past info: https://news.ycombinator.com/item?id=37512147) This was necessary because I felt it was unacceptable for entire datasets to be forced offline by one lawyer. The ethical problem is that I'm sympathetic with people who want to remove their content from AI training data. I received an email from the Danish Rights Alliance about Books3: https://pastebin.com/6qw3yMWZ They point out that this is illegal in Denmark and elsewhere, and threaten to ban thenose.cc from Denmark. Obviously, the threats are meaningless. But I'm interested in your views on whether we should comply with the request by removing the specific titles they list. I was thinking of saying "If you say 'please', I will remove the listed titles." There are 109 entries, so it wouldn't be too much hassle to just remove those from the tarball, and it would be amusing to force a lawyer to ask nicely. For now, I asked for a complete list of the full filenames they want to be removed, along with proof that they represent the listed rightsholders. I'm more interested in how you feel. It seems reasonable to let people opt out of training. We could formalize this process by setting up a way to do this. We could also just ignore takedown demands. What do you think? |
Suspect this will be unpopular, but..
Information wants (and deserves) to be free. People make sophisticated and convincing arguments for incentivizing creation, they resonate with me but ultimately I just do not agree with them.
I think projects like yours are on the right side of history, but it will take a long while before we collectively agree.
Your ethical dilemma hinges on whether or not you agree with the above.