|
|
|
|
|
by manishsharan
234 days ago
|
|
Thanks for sharing. TIL about rerankers. Chunking strategy is a big issue. I found acceptable results by shoving large texts to to gemini flash and have it summarize and extract chunks instead of whatever text splitter I tried. I use the method published by Anthropic https://www.anthropic.com/engineering/contextual-retrieval i.e. include full summary along with chunks for each embedding. I also created a tool to enable the LLM to do vector search on its own . I do not use Langchain or python.. I use Clojure+ LLMs' REST APIs. |
|
I've struggled to find a target market though. Would you mind sharing what your use case is? It would really help give me some direction.