Hacker News new | ask | show | jobs
by winstonwinston 55 days ago
> There's nothing anyone can do about it, but the suspicion is that the big companies have taken everyone's code on GitHub, without consent, and trained on it.

I asked agent X what is the source of training data it generated code from, it couldn’t say. Then I asked why the code implementation is exactly the same as the output of agent Y. It said they were trained on the same ‘high-quality library’, and still couldn’t say which one.

So I guess that’s fine because everyone is doing it.

1 comments

You asked a machine that makes things up when it doesn't know the answer a question that it has no way of knowing the answer to. I don't know why you bothered to relay its response.
It's a software, not a machine. The comment is relevant to the suspicion that THE software is using (distributing) some OSS code without attribution.
> It's a software, not a machine.

It's both and I don't see how that matters.

> The comment is relevant to the suspicion that THE software is using (distributing) some OSS code without attribution.

The accusations in the comment are relevant.

Framing it as a conversation with an LLM and showing its responses, when that LLM does not have access to the answer and is fully making up a response, is irrelevant and distracting.

I see your point.