Hacker News new | ask | show | jobs
by homebrewer 91 days ago
You can always ask your parent company to train on their usage. I hear they have incredibly massive codebases: Windows, Office, MSSQL, which stay out of training data for some reason.

I thought neural nets never repeat the training data verbatim, and copyright does not pass through them, so what's the problem?

2 comments

How do you know that isn't already the case?
Who said they don't?