Hacker News new | ask | show | jobs
by ithkuil 1177 days ago
> By indexing and training on everything it can find in the <PUBLIC> internet?!

and that's bad because?

I would see the point if they were training on my private data I entrusted to somebody and they illegally obtained it without my permission. Are they doing that?

1 comments

See my edit: They will ignore licensing information and train on data, possible privacy related information too, without any respect.

See this: https://news.ycombinator.com/item?id=32573523

What kind of "privacy related information"? This is data on the open internet!