Hacker News new | ask | show | jobs
by ykevinator2 1668 days ago
Its a great idea and you should keep going with this (and nice work so far). You may want to consider the idea of only presenting disambiguating tags. A simple heuristic like finding the top x tags of close to equal cardinality and throw the rest away. The value of what you've built is disambiguation and if you present every single tag, most of those tags are noise. Hope this helps.
1 comments

> “disambiguating”

Interesting to think about.

Something like—High frequency tags may be present in the name or as synonymous with the search term, whereas less common/frequency terms, which would characterize the link, would not appear.

I’ve not done any work on this topic but it’s very interesting.

I’m thinking about search on e-commerce sites (lol. holiday shopping) where you’re searching on a general term or type of product. The results are mixed, so you sort by price. Let’s see what a high value item looks like, and a low value; this is an evaluation heuristic to reveal different quality products.

Maybe there should be both tags then? A few high frequency and a few low frequency?