Hacker News new | ask | show | jobs
by viraptor 704 days ago
> would be training with properties they own or have licensed appropriately

Cool - you want to train on medical data? That will be $1M per paper per day to Elsevier for a licence. (Apply similar for movies, news, books, etc.) Ensuring no knowledge remains "common".

The moment they can, the big data holders will charge whatever they can get away with. For an average person that may be even worse than the current content stealing.

1 comments

Papers are ridiculously low signal-to-noise when it comes to knowledge. (see https://markusstrasser.org/extracting-knowledge-from-literat... ... but sure, LLMs might have a better extraction rate)

And, importantly if papers become so valuable ... authors will suddenly start to want their cut and will go to the publisher that pays the more. And so on.

It would be amazing if papers would worth that much (as it would mean they can help creating at least that much value downstream), and it would mean there would be a lot of money in writing new papers. (Oh imagine that flood of low quality shit! Oh no, it's almost the same as now! Ehhh.)