|
|
|
|
|
by IgorPartola
1553 days ago
|
|
I still don’t get why. It cannot be so difficult for them to keep things like literal search, can it? What is the incentive to remove it and replace it with a needlessly more complex almost literal but still fuzzy search? I do suspect the main thing people complain about currently with Google is the abundance of ads and the algorithm that has encouraged stupid amounts of articles of a certain length. Recipe for baked potatoes is now 2000 words long. |
|
Greater scale = greater cost of keeping data hot in their search data-warehouses (esp. in light of contention over memory/caches.) Keeping around both a source-text string and its tsvector representation (or whatever Google's version of that is) is a "thing that doesn't scale" that they could provide at 1B queries/day, but probably not at 10B queries/day.
> the algorithm that has encouraged stupid amounts of articles of a certain length. Recipe for baked potatoes is now 2000 words long.
That's not the algorithm's fault per se; that's instead the fact that recipes can't be copyrighted, and so these sites can freely steal + repost one-another's recipes, and so you'll find the same recipe word-for-word on many sites, thus making an exact match in the recipe part not contribute highly to ranking any particular site. The 2000-word blog post, on the other hand, is actual Intellectual Property unique to the site posting it. So it only appears in the one place; and so when your query matches it, it ranks quite highly indeed.