| Why couldn't you, as a human do that to verify it? The other day I had GPT write a rap battle between Burger King and Ronald McDonald. One of the stanzas came back: Burger King:
Your burgers are plain, your buns a bore.
Your clown's been around since '63,
I'm sure my flame-grilled taste will leave you impressed
My burgers are fresh, my fries are the best
It turns out that yes, Ronald McDonald was first introduced in 1963. https://en.wikipedia.org/wiki/File:McDonald%27s_commercial_(... (from https://en.wikipedia.org/wiki/Willard_Scott#Created_Ronald_M... )So here's the challenge for you - who do you compensate for that line? The complaint that people have isn't that GPT isn't citing its sources but rather that it isn't compensating the people who created the data that has that information. ... and now, if you're ever asked about historical clown trivia and pull out the "Ronald has been around since 1963", who should you give a royalty to? Me (for writing this), GPT (for making me aware of it), Wikipedia (for the source of my links in this post), the estate of Willard Scott for the Joy of Living (which Wikipedia cites), some random blog author that had some clown trivia on it that happened to have been part of the training set for GPT? |
It isn't just monetary compensation that's important here.
I come at this from the point of view of a scientist who is expected to reference ideas. Not necessarily back to their original source, but at least back to a source that can theoretically point back to another link in the chain.
Sure, I can manually search for a reference based on what ChatGPT gave me. Or someone could spend a few minutes adding a few lines of code to ChatGPT to save millions of people some minutes of time.
-----
What would be awesome is an LLM that you can feed data to, and it can then write a paper based solely on the data you feed it.