Y
Hacker News
new
|
ask
|
show
|
jobs
by
kbrisso
202 days ago
I built this for local RAG
https://github.com/kbrisso/byte-vision
it uses llama.cpp and Elasticsearch. On a laptop with 8 GB GPU it can handle a 30K token size and summarize a fairly large PDF.
1 comments
busssard
202 days ago
elasticsearch is the true limitation of rag systems...
link
kbrisso
202 days ago
The vector search works great once you figure it out. I wanted to focus on writing the application and not have to rewrite a document store.
link