Hacker News new | ask | show | jobs
by martinwoodward 91 days ago
It wasn’t previously opt-in.

Previously we didn’t do any training on usage. However as other products have come into the market they do train on usage. We’ve been training on our internal usage for just over a year and have seen some major improvements. For details see of the types of improvements we’ve seen from training on our internal usage check out this article: https://github.blog/news-insights/product-news/copilot-new-e...

2 comments

You can always ask your parent company to train on their usage. I hear they have incredibly massive codebases: Windows, Office, MSSQL, which stay out of training data for some reason.

I thought neural nets never repeat the training data verbatim, and copyright does not pass through them, so what's the problem?

How do you know that isn't already the case?
Who said they don't?
This seems reasonable, maybe too much so.

> If they want to incentivise people to contribute their sources and copilot sessions, they could easily make it opt-in on a per-repository basis and provide some incentive, like an increased token quota.