|
|
|
|
|
by lairv
1258 days ago
|
|
I feel like people on HN have different opinions about AI training when they are concerned (code, blogs) rather than when other are concerned (art). I've seen post saying it was normal for AI to train on scrapped art data, but offended when AI trains on their github code My opinion is that if something is publicly available on the internet, then an AI should be allowed to train on it |
|
The AI doesn't grok the code, it just copies it. This is fine for art, because accuracy isn't required. It's not fine for code, because code needs to be accurate.
Generating code using an AI is going to lead to vulnerabilities which were either present in the original training code, or have been created by mis-applying training code.
Granted, junior devs (hell, even senior devs) can and will make the same mistakes, but at least someone understands the code and can fix the vulnerability relatively easily once it has been exposed. The AI doesn't understand that it made a mistake, and has no idea how to fix it.