|
|
|
|
|
by namaria
438 days ago
|
|
> "they can only produce code that's almost identical to what's in their training data" Again, you're misinterpreting in a way that seems like you are reacting to the perception that someone attacked some of your core beliefs rather than considering what I am saying and conversing about that. I never even used the words "exact same thing" or "almost identical". Not even synonyms. I just said overfitting and quoted from an OpenAI/Anthropic paper that said "predict plausible changes to code from examples of changes" Think about that. Don't react, think. Why do you equate overfitting and plausibility prediction with "exact" and "identical". It very obviously is not what I said. What I am getting at is that a cannon will kill the mosquito. But drawing a fly swatter in the cannonball and saying the plastic ones are obsolete now would be in bad faith. No need to say to someone pointing that out that they are claiming that the cannon can only fire on mosquitoes that have been swatted before. |
|
Reading back, you said:
> I often see people wondering if the some coding task is performed well or not because of availability of code examples in the training data. It's way worse than that. It's overfitting to diffs it was trained on.
I'll be honest: I don't understand what you mean by "overfitting to diffs it was trained on" there.
Maybe I don't understand what "overfitting" means in this context?
(I'm afraid I didn't understand your cannon / fly swatter analogy either.)