Hacker News new | ask | show | jobs
by kelnos 1001 days ago
> yet another stackoverflow copy

I never got why these even ever appear in Google search results (or any search results, really). It feels like it would be super trivial to identify sites that are scraped copies of other sites. Granted, without foreknowledge, the engine doesn't know which is the original. But at the very least this can be determined by a human once, and then the problem goes away forever for that particular site.

4 comments

Maybe that scraped copy leverages doubleclick so its success is aligned with Google interests, sometimes even more than the original website.
That ship has already sailed, they are already using AI in mass to generate original looking content.
Trying to find any guides for anything in Baldur's Gate 3 returns page upon page of AI generated garbage, a sure sign of things to come.
Funny that you mention this game. bg3.wiki, the community wiki had a lot of troubles with SEO. It got ignored or pushed down in the search results for a very long time, while the awful Fextralife wiki that includes a Twitch view botting iframe on every pages was always first.
Sadly even the Fextralife wiki (garbage that it is) is better than most of the other results and that's still drowned out by the AI spam.
At this point it's just safer to treat any content newer than about a year ago as highly suspect. Bots and fake content have been around for years, but things changed when ChatGPT and the copycats went live.
Which is French for "in mass".
The blue ribbon chef was said to be the cream of the cream, so the restaurant owner was happy for him to have white card over the place. He arranged an outside the work of fatty liver, a main course of rooster of wine with eat all, and as the blow of mercy: burned cream; the full menu was a feat of strength! He made sure to wish the diners good appetite. However, when the owner visited from her foot on the ground she turned into a terrible child and demanded mouth amusers and crescents. She hated the decorative objects of art made of chewed paper.

(When we steal from French, we don't translate it to English, it becomes English).

Well, there are loanwords, and there are calques.
it's kind of upsetting that the first to benefit from LLMs are the scum of the internet.
It's googles fault. They are the ones who make this a viable business model. They pay the ads, and they pollute their search results with this garbage.

100% Google who are destroying this part of the internet.

Google gave and google took
The love of money
How can the search engine not able to tell who the original is? Originals always exist earlier, not to mention SO.com domain rank is way higher than those spammed sites that existed for less years.
Even if it wasn't easy to detect SO rip-offs, surely Google engineers see them all the time when they perform searches.