Hacker News new | ask | show | jobs
by i_have_an_idea 251 days ago
> I've consistently found Gemini to be better than ChatGPT [ because ] Google has crawled the internet so they have more data to work with.

This commonly expressed non-sequitur needs to die.

First of all, all of the big AI labs have crawled the internet. That's not a special advantage to Google.

Second, that's not even how modern LLMs are trained. That stopped with GPT-4. Now a lot more attention is paid to the quality of the training data. Intuitively, this makes sense. If you train the model on a lot of garbage examples, it will generate output of similar quality.

So, no, Google's crawling prowess has little to do with how good Gemini can be.

1 comments

> Now a lot more attention is paid to the quality of the training data.

I wonder if Google's got some tricks up their sleeves after their decades of having to tease signal from the cacophony of noise that the internet has become.

if the quality of search results today is anything to go buy -- clearly no
Google's search is finely tuned to push you into clicking the link of who pays them the most. The search results are excellent quality for their customers. Your mistake is thinking you are the customer.