Hacker News new | ask | show | jobs
by vintermann 769 days ago
The insidious thing is that quite possibly, it "feels" there should be a couple of bugs like that because all the codebases it was trained on had a few.

It might even take the context of the typos in your code comments, and conclude "yeah, this easy to miss subtle error feels right about here".

1 comments

That is a problem but thankfully there is a lot of attention on training with highly curated high quality data right now because it is a known problem. Buggy code is still valuable training data if you use it as part of a question and evaluate the response against a corrected version of the code when training the model to perform a task like bug fixing.