Hacker News new | ask | show | jobs
by ladon86 1173 days ago
> Google denies doing it

Read their statement carefully and it's actually not a denial of the allegation.

> But Google is firmly and clearly denying the data was used: “Bard is not trained on any data from ShareGPT or ChatGPT,” spokesperson Chris Pappas tells The Verge

* Allegation: Google used ShareGPT to train Bard.

* Rebuttal: The current production version of Bard is not trained on ShareGPT data

Both things can be true:

* Google did use ShareGPT to train Bard

* Bard is not currently trained on any data from ShareGPT or ChatGPT.

It depends on what the meaning of is is ;)

2 comments

Intent matters I guess.

Did they accidentally train on that public piece of info they scraped anyway because they are scraping the whole web?

Or did they intentionally scrape chatgpt output to see if that would help?

They could have trained, then modified code, repeat, to better enhance training in the current version.

Then after, train on raw data.

Trained would mean the current model wasn't trained at all from ShareGPT data, not that was trained on it previously, and isn't being trained anymore.

This association makes no sense.