|
|
|
|
|
by gav
2675 days ago
|
|
I call this the "massive heterogeneous catalog" problem. Generally e-commerce retailers have grown from a fairly narrow set of product categories (e.g. books for Amazon) to adding more and more diverse categories. This has a dramatic impact on site search quality. If you consider a simple example, shoes. You only need a couple of facets to filter products to a reasonable set to browse through: gender and size. Now start adding accessories, athletic clothing, and so on, and the results end up getting harder to navigate with generic search terms like "shoe" giving less relevant results (not having the context of the user's intent hurts here). I tried "shoe" on Amazon, got over 400,000 results with the first item being a shoehorn. It takes a bunch of clicks to deal with that. This search problem gets worse as catalog sizes grow even bigger. Personalized results help a lot and Amazon seem to fail me with this, they don't do a good job bubbling up the products I buy to the top. It's a hard problem to solve but it's not going to kill Amazon. |
|
Systems like Solr, elastic search and endeca (out of the box) all assume relevance means keyword frequency in a product page, with some weighting depending of title, description, tag, etc. Delivering relevant results that users might want to purchase requires taking these systems, adding or customizing their NLP techniques, operationalizing historical user search & purchase data to determine intent, personalizing by shopper history, etc.
The challenges of massive heterogenous catalog affect other areas... Chief among them search result personalization… an individual’s gaming purchase history might cause ‘button down’ to return gaming keyboards, rather than oxford shirts, while a pet products purchase history could lead to a search for turkey returning turkey dog food.
The fact that Amazon fails to personalize search results is evidence of the difficulty & opportunity here. The sort of pervasive personalization found in AirBnb, facebook, google are simply out of reach of most ecommerce retailers…