|
|
|
|
|
by smokel
395 days ago
|
|
> Gemini's 1M token context window is really unbeatable. How does that work in practice? Swallowing a full 1M context window would take in the order of minutes, no? Is it possible to do this for, say, an entire codebase and then cache the results? |
|
Caching a code base is tricky, because whenever you modify the code base, you're invalidating parts of the cache and due to conditional probability any changed tokens will change the results.