| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by visarga 576 days ago
	You can achieve the same effect by using LLM to do question answering prior to embedding, it's much more flexible but slower, you can use CoT, or even graph rag. Late chunking is a faster implicit alternative.