Hacker News new | ask | show | jobs
by dhimes 5630 days ago
more and more searches are using ever increasing numbers of search term words, and that can get spammy

Why is that? I thought more words would give more focussed, less spammy results.

On another point it seems to me that Google would have to tread carefully here if the set of spammers has a large intersection with the set of folks who buy advertising from Google. I would (however naively) suspect this might be the case.

1 comments

> Why is that? I thought more words would give more focussed, less spammy results.

I've noticed that if you use too many words, you more often end up with keyword-stuffed or aggregated/scraped pages, because they just happen to use all the words in your search query on the same page. Sometimes this is because there aren't actually any real results for the particular narrow niche you wanted, but sometimes it's just because none of the real results use all of the keywords you tried verbatim, and Google wasn't able to figure out that those non-verbatim matches were more relevant than the exact-match-but-crap pages (admittedly a hard problem).

Ah, I see what you mean. These link farms are getting clever, and using content "real" enough to pass the automated sniff test.
Part is also what counts as spam. Google doesn't count a lot of index-type pages as spam, so if you search for a conjunction of a few programming terms, [scala foo bar baz], you often get a page that indexes blogs on a topic, in this case Scala. The post titles on that page will use all of your search terms, but often in different post titles, effectively erasing the conjunction operator in your query.