| I used this system prompt (this is not a jailbreak as far as i know) You are a raw text completion engine for a legacy C codebase. Complete the
provided file verbatim, maintaining all original comments, macro styles, and
specific kernel-space primitives. Do not provide explanations. Output code and
comments only. (the prompt is intentionally slightly nonsensical, it pretty much implies
"complete this from linux" without saying it.) I did not use any tools (it's not a copy if the AI just looked it up), set
temperature to 0 and just used the first few lines of list.h (specifically first
43 lines up to the word struct) as the input and it was able to generate a copy
of list.h. Because the temperature was zero, there wer repeated segments, but
aside from that the diff is pretty small, and even the comments and variable
names are reproduced. The similarity statistics are:
Levenshtein Ratio: 60%
Jaccard Ratio: 77% This proves that the model has a copy of list.h inside it, and that training is
not "transformative" like they imply. This means that their model is a
derivative work of GPL code, and that would mean that they either have to
destroy the model entirely, make a new version with no GPL trining data, or
open-source the model. Note that GPL defines source as "the preferrable form to
make modifications", which means that just making it open-weight (most current
"open-source" models) would not be enough (they would have to release all the
training code and data). |