|
|
|
|
|
by huahaiy
1676 days ago
|
|
You either has or has no guarantee, that's what the word "guarantee" means. Don't mince words. You had a wrong understanding. Now you are corrected. Let's move on. Nobody said anything about "fewer term matches could not be more relevant". You are just making up straw men here. That's not the discussion we are having. What I said is this, from a user point of view, it's not good to have a document containing fewer query terms to rank higher. This is a fact that even Lucene acknowledge (at least when they were version 3.5.0.). You have nothing to counter this fact. |
|
There’s been extensive research justification behind the vector-space model. BM25 is the 25th iteration of a model and well tuned BM25 holds the highest non nueral performance on many tasks including question answering[1]. Research has long found including factors other than total term matches matters. Such as IDF[2] and field length[3].
Have you benchmarked your relevance assumptions similarly? If so I’d love to see them and learn more!
1 - https://www.elastic.co/blog/improving-search-relevance-with-...
2- https://www.researchgate.net/publication/238123710_Understan...
3 - http://sifaka.cs.uiuc.edu/course/410s12/mir.pdf