|
|
|
|
|
by PaulHoule
1875 days ago
|
|
Ranking 'quality' is tough because it has multiple dimensions and aggregating multiple kinds of utility is problematic, see https://en.wikipedia.org/wiki/Arrow%27s_impossibility_theore... developing a good ranking function for bag-of-words full text search is about carefully balancing the attraction of larger vs smaller documents in the collection. If you try to aggregate several bad search engines you get a bad search engine, if you try to add more factors you get a bad search engine, instead you have to walk the path of BM25 or one of the more modern 'information theoretic' ranking functions. (e.g. precious knowledge I got from years of reading conference proceedings and still having no idea how to make a good ranking function then finally reading the right review paper that pointed out the two discoveries the conference made in the first 10 years!) |
|