Hacker News new | ask | show | jobs
by tk90 760 days ago
Isn't there retrieval degradation with such a large context size? I would still think that a RAG system on 128K is still better than No Rag + 1M context window, no? (assuming text only)
2 comments

Absolutely. Gemini results tend to drop off after 128k tokens according to RULER: https://github.com/hsiehjackson/RULER
If I'm reading the paper right, it says Gemini 1.5 is good up to 128k, and RULER doesn't test beyond 128k
Hmm perhaps you are right.. There's no context on the page about what ">128k" means so possibly a bad assumption on my part!
Not sure why you've been downvoted. Needle in a haystack testing exists for a reason