I'd be shocked if they came anywhere near email data for Bard training. Why do they need that with all the reputational baggage that comes with it, they have only, like, the rest of the Internet at their disposal?
Clean data. A bunch of data points that are in a good enough state / structured to just throw into the training / eval makes a bigger difference than a bazillion messy data points.
I can easily imagine people in charge with the mentality of "there's no way that anyone can prove we did it."
It's very improbable, but looking at the "AI integration / product" race it is still a non-zero chance it could have happened.
I can easily imagine people in charge with the mentality of "there's no way that anyone can prove we did it."
It's very improbable, but looking at the "AI integration / product" race it is still a non-zero chance it could have happened.