|
|
|
|
|
by PaulHoule
990 days ago
|
|
I've used https://sbert.net/ My take is you might like keyword searches better for some queries and you might like embedding search for others. The problems of: (1) How to combine keyword search and embedding search (you'd imagine you'd want a ranking function that handles both) and (2) How to handle chunks are both hard. As for (2) you probably want to make the chunks as big as you practically can, you should be chunking on tokens instead of characters if you at all can. With the chunks of course you don't get a score for the query-document relationship you get the query-chunk score instead which isn't quite the score you really want, aggregating all the chunk hits and properly chunking the data is an open problem to say the least. |
|
This video suggests Elasticsearch is (or will be) able to blend the two: https://www.youtube.com/watch?v=5Qaxz2e2dVg
I'm hoping meilisearch adds that because ES is a beast of a software package.