Hacker News new | ask | show | jobs
by prewett 73 days ago
Even worse, the training set probably includes a lot of code that needed review but didn't get it...
1 comments

If we know the outcome of that code, such as whether it caused bugs or data corruption or a crappy UX or tech debt -- which is potentially available in subsequent PR commit messages -- it's still valuable training data.

Probably even more valuable than code that just worked, because evidently we have enough of that and AI code still has issues.