| Is there any evidence of Copilot producing substantial (100s of lines) verbatim copies of copyrighted works? Absent this, I don't think there's a case. The courts have given extraordinarily wide latitude to fair use and ML algorithms are routinely trained on copyrighted works, photos, etc. without a license. I understand that this feels more personal because it involves our field, but artists and authors have expressed the same sentiment when neural nets began making pictures and sentences. The question here is no different than "Is GPT-3 an unlicensed, unlawfully created derivative work of millions, if not billions of people?" No, I'm quite confident it is not. |
It doesn't need to be substantial. In Google v. Oracle a 9-line function was found to be infringing.