Hacker News new | ask | show | jobs
by paulryanrogers 1791 days ago
Training is one thing. Regurgitating chunks verbatim without attribution is another.
1 comments

In general taking short excerpts of a copyrighted work is legal and is not infringement.
Try lifting a riff from a Metallica song and see how far you can get selling it commerically.

Also, Copilot is copying much more than short excerpts, going as far as to reproduce large amounts of copyrighted code verbatim[1].

[1] https://twitter.com/mitsuhiko/status/1410886329924194309

If the sample of the Metallica song is insubstantial enough then you may well prevail in court.

It's unsurprising that copilot can reproduce the most famous subroutine of all time precisely, given that it occurs in hundreds or thousands of repos.

Also that code is not copyrightable. Pure algorithms are not copyrightable, copyright of code arises from its literary qualities.

E.g. I can copy an algorithm out of an ISO spec and that doesn't make my code a derivative work of the spec requiring me to pay royalties to ISO.

When you strip out the algorithmic elements out of fast inverse sqrt, you are left with what? Single letter variable names. That is certainly far below the threshold for copyright.