Hacker News new | ask | show | jobs
by speedgoose 1125 days ago
It’s perhaps in the training dataset but unless your code is extremely common and duplicated, it’s probably not in the final models. They aren’t that big.
2 comments

Hmm, I still don't like this argument. Whether there are actual bits of the code in the model or not, his code is still in there somewhere, even if it's just an approximation.

I feel quite similar personally, I've worked hard on open source and I'll never have the same permissive license again after this.

I asked it to write a function with a given name and it came up with stuff that was in the general ballpark of the software. So yeah, it’s in there.