OpenAI definitely has used input tokens to further train its models, but Anthropic has emphatically stated they do no such thing. I have trusted them so far on that. Are you saying they're lying?
I'm not going against any explicit policy or promise to customers that a particular AI company might make, but rather what is and can be happening that a lot of the public doesn't realize in general. A lot of what is attributed to AI, can be the work of humans (including customers), that in various cases were or arguably being ripped-off. Speaking of which, there are lots of cases of companies claiming to use or have an AI product, but instead were just using humans for low pay (but wasn't previously referring to that).
In the Tom and Bill shell game example given, where they are being used for their code and to correct code that is sold to other customers, it's not a "now" thing either. Meaning Tom, Bill, and the other customers don't have to be exchanging code in real time, when that code is being uploaded, saved, and trained on by AI companies. Tom could have worked on some code a month ago, that was slurped up from Susan. Tom fixed many of the errors of Susan's code, which is now fed to Bill, when he inputs the correct prompts. Bill thinks the AI is the "genius", but is unknowingly benefiting from Bill's and Susan's work, review, and corrections. Potentially more devastating to Bill, is what he may mistakenly think was private or secret to only him, is fed to other customers for profit.
AI and their companies are also connecting people, in that indirect black box way, where those people may not realize they are connected, being fed, and are correcting each others code. Yeah, some may not care where the code comes from or how, but that they can use it for their personal purposes. Sure, that's not the only part of the story and LLMs are doing some interesting and amazing things, but there is another part of that story that is not being more widely acknowledged. In a similar way in which has angered so many artists and authors, where they feel aggrieved and taken advantage of; relative to many art, song, and book lawsuits.
In the Tom and Bill shell game example given, where they are being used for their code and to correct code that is sold to other customers, it's not a "now" thing either. Meaning Tom, Bill, and the other customers don't have to be exchanging code in real time, when that code is being uploaded, saved, and trained on by AI companies. Tom could have worked on some code a month ago, that was slurped up from Susan. Tom fixed many of the errors of Susan's code, which is now fed to Bill, when he inputs the correct prompts. Bill thinks the AI is the "genius", but is unknowingly benefiting from Bill's and Susan's work, review, and corrections. Potentially more devastating to Bill, is what he may mistakenly think was private or secret to only him, is fed to other customers for profit.
AI and their companies are also connecting people, in that indirect black box way, where those people may not realize they are connected, being fed, and are correcting each others code. Yeah, some may not care where the code comes from or how, but that they can use it for their personal purposes. Sure, that's not the only part of the story and LLMs are doing some interesting and amazing things, but there is another part of that story that is not being more widely acknowledged. In a similar way in which has angered so many artists and authors, where they feel aggrieved and taken advantage of; relative to many art, song, and book lawsuits.