Y
Hacker News
new
|
ask
|
show
|
jobs
by
tk90
760 days ago
Isn't there retrieval degradation with such a large context size? I would still think that a RAG system on 128K is still better than No Rag + 1M context window, no? (assuming text only)
2 comments
cathalc
760 days ago
Absolutely. Gemini results tend to drop off after 128k tokens according to RULER:
https://github.com/hsiehjackson/RULER
link
ayyndrew
759 days ago
If I'm reading the paper right, it says Gemini 1.5 is good up to 128k, and RULER doesn't test beyond 128k
link
cathalc
759 days ago
Hmm perhaps you are right.. There's no context on the page about what ">128k" means so possibly a bad assumption on my part!
link
afro88
760 days ago
Not sure why you've been downvoted. Needle in a haystack testing exists for a reason
link