Hacker News new | ask | show | jobs
by jjeaff 1276 days ago
One avenue that chatGPT has, and I'm not sure if it is being utilized at all yet, would be the ability to feed it the unimaginably huge body of information locked behind copyrighted textbooks, books, academic papers and other pay walled information.

Imagine the knowledge that could be accessed by feeding all that information into an ai engine like chat gpt. Presumably, it would not break copyright rules anymore than a regular human reading a bunch of papers behind a paywall and regurgitating the learned information.

2 comments

Its not stealing your writings its sharing the conclusion in its own words like you can legally do a book review.

It wont be just like search as we burn all the books (websites) on the www. Each website needs a human to pay for it and to lure at least some traffic to it.

This preservation after death has its downsides too. It will turn into the steroids v of stack overflow outdated answers.

It will be (or already is) brilliant at combining things writing humanly readable code in your favorite language and bake the same in lower ones that are perhaps oddly stable.

So I guess we will finally be able to do away with all popular languages and use them for pseudocode only⸮

Why would copyright holders let that happen? Also, how is the model trainer getting access to all of this material?
Google Books is a large corpus that at least Google has access to.