Hacker News new | ask | show | jobs
by akiselev 1133 days ago
You’re almost certainly going to have to write your own splitting code for anything nontrivial. LlamaIndex breaks down hard when there’s a lot of markup in the document, for example. You’ll also want control over the vector search strategy (just using the query or chunk embedding may not be enough)
1 comments

in terms of search store and engine, would you agree that pgvector is sufficient for most text-specific cases?