Hacker News new | ask | show | jobs
by Havoc 1444 days ago
I don't think google's recent troubles are a result of index size.

Pretty sure G could throw more money and resources at it if they thought that would make a dent.

It feels more like a lot of real content is collateral damage to the SEO vs google wars. e.g. A blogger was complaining the other day that someone had set up an automation to automatically scrape their content the second it gets published, run it through translate twice and publish the resulting semi-gibberish.

Those sort of shenanigans are I suspect quite hard to deal with even if you're google

1 comments

It’s only going to get worse with the proliferation of NLP nets a la GPT.

Will be interesting to see a decade from now how researchers collect a corpus that isn’t chock full of a model’s own output