Hacker News new | ask | show | jobs
by shireboy 1132 days ago
> “If GitHub can produce code by training an AI on all code it is hosting, Youtube could produce videos and music by training an AI on all content it is hosting, the Writer Guilds could produce books by training an AI on all books it owns the rights for, Shutterstock could produce more stock images by training an AI on all stock images it is hosting.”

There is a subtle difference here. Microsoft isn’t just producing code based on GitHub data. They are producing a tool that lets others generate code based on GitHub data. I do think consideration of the source data creators intent is important- and there is a case CoPilot hasn’t done that. But if Shutterstock wants to use any images _that they have been given license for and treat creators fairly for_ to build a tool that lets others generate images, they should be allowed.

Also, the op argues only MS has access to train based on all of GitHub. Others might run into rate limiting etc. However we know Amazon and others do have similar models. This would indicate MS may have a competitive edge but not a full market lockout.