| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Springtime 21 days ago
	There's evidence various third-party models (including Deepseek) used distilling in training, based on models from those leading services. So they have more flexibility with pricing.

2 comments

malnourish 21 days ago

Is that fundamentally any different than what e.g., Meta and OpenAI have done?

Besides, hasn't SCotUS ruled that raw LLM output isn't subject to copyright? So these companies would be breaking a ToS at worst.

link

behnamoh 21 days ago

So? And Anthropic/OpenAI literally stole copyrighted content to train their models.

link

Springtime 21 days ago

The point was that distilling based on others' models for training means they're not spending the same amount on R&D and/or training, giving them headroom in other ways (responding to the parent's point). It wasn't a comment reflecting on copyright/fair use.

link

behnamoh 21 days ago

In the same fashion, Anthropic/OpenAI also reduced their training cost by not purchasing the license to copyrighted work and stealing it instead.

link