Hacker News new | ask | show | jobs
Hybrid search vs. Token pooling benchmarking (blog.colivara.com)
3 points by jonathan-adly 555 days ago
1 comments

Hi HN!

We benchmarked two ways to improve the latency in RAG workflows with a multi-vector setup. Hybrid search using Postgres native capabilities and a relatively new method of token pooling. Token pooling unlocked up to 70% faster latency with <1% performance cost.