| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by zerobees 3 hours ago

This doesn't at all change the irony of big AI labs complaining about Chinese startups stealing the labs' IP, essentially by scraping the responses.

HN has a higher proportion of AI promoters than AI skeptics, and for a good while, the default response to complaints from book authors, bloggers, and other content creators was that "you put it on the internet so it's fair game", or "it's no different from a human learning from your works". So yeah, unless we're willing to revise these answers, I think the same "tough luck" reasoning should apply here.

For folks who are at Anthropic, OpenAI, xAI, or Google, and think it's fundamentally different, I would ask you to think long and hard about that answer.

1 comments

setopt 3 hours ago

Completely agreed. I would go further and say that it should be legal to scrape responses from LLMs to train new LLMs, and that forbidding that in your ToS should be considered an illegal contract. That’s simply the best way to avoid complete monopolization of the space, without requiring more drastic measures like antitrust down the line (which we seem to not manage well these days, given the number of monopolies). As long as you pay for your tokens like anyone else, "Big LLM" shouldn’t be allowed to control what you use the output for.