| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by mark_l_watson 638 days ago
	I just took the time to read through all source code and docs. Nice ideas. I like to experiment with LLMs running on my local computer so I will probably convert this example to use the light weight Python library Rank-BM25 instead of Elastic Search, and a long context model running on Ollama. I wouldn’t have prompt caching though. This example is well written and documented, easy to understand. Well done.