|
|
|
|
|
by anonymouskimmer
1226 days ago
|
|
> You can't strap a source-crediting mechanism on top of a transformers-based model after the fact. I've read that ChatGPT is not connected to the net, but if it was: Couldn't you have it do a google search (or better yet corpus search) for the string it generated and then return the most significant matches (significance by string matching, not google rank)? It would be really crude, but wouldn't this just be a handful of lines of code that don't interfere with the "transformers-based model" code at all? |
|
The other day I had GPT write a rap battle between Burger King and Ronald McDonald. One of the stanzas came back:
It turns out that yes, Ronald McDonald was first introduced in 1963. https://en.wikipedia.org/wiki/File:McDonald%27s_commercial_(... (from https://en.wikipedia.org/wiki/Willard_Scott#Created_Ronald_M... )So here's the challenge for you - who do you compensate for that line?
The complaint that people have isn't that GPT isn't citing its sources but rather that it isn't compensating the people who created the data that has that information.
... and now, if you're ever asked about historical clown trivia and pull out the "Ronald has been around since 1963", who should you give a royalty to? Me (for writing this), GPT (for making me aware of it), Wikipedia (for the source of my links in this post), the estate of Willard Scott for the Joy of Living (which Wikipedia cites), some random blog author that had some clown trivia on it that happened to have been part of the training set for GPT?