Y
Hacker News
new
|
ask
|
show
|
jobs
by
visarga
576 days ago
You can achieve the same effect by using LLM to do question answering prior to embedding, it's much more flexible but slower, you can use CoT, or even graph rag. Late chunking is a faster implicit alternative.