|
|
|
|
|
by inertiatic
2283 days ago
|
|
A lucene based solution would probably scale better and would allow you to implement more complex behavior. You get analyzers, stopword filters, synonyms etc. out of the box, and you can express things like "virus within 5 words of lung" or "covid or virus but covid is way more important".
I believe given the rather small dataset and the fact that you won't even expose more complex queries on your interface, the main benefits are the analyzers and various filters you can use. If you are already doing stemming, stopword filtering and maybe synonyms you're probably fine. (My email is in my profile if you want to discuss this in further detail) |
|