Hacker News new | ask | show | jobs
by kstrauser 849 days ago
It was meant for the public to see, not to bulk copy it en masse to somewhere else.

Similarly, I don't want my blog posts used to train LLMs. I know they're likely to be since they're published right there on the Internet for anyone to see and read. But my intent was for other humans to see and read them, not for someone to feed them into a regurgitator. There aren't technical means that let me allow humans to read my stuff without allowing LLMs to ingest it, and someone could make the (bad) case that if I didn't want my work to be used to train an LLM, I shouldn't have made it public. Maybe. However, I reserve the right to think someone's an ass for doing it.

Well, no technical hurdles kept the person from copying data out of the network people meant to post it to. It's probably not illegal. It's not a nice thing to do, though.

1 comments

> It was meant for the public to see, not to bulk copy it en masse to somewhere else.

Except literally the entire design is for other Mastodon servers to bulk copy it en masse to somewhere else.

> There aren't technical means that let me allow humans to read my stuff without allowing LLMs to ingest it

Yes there are. Don't make it public.

> However, I reserve the right to think someone's an ass for doing it.

Of course! You can think anyone is an ass. You can think anything you want. That doesn't mean that person did anything wrong.