|
|
|
|
|
by anotherpaulg
505 days ago
|
|
This is a harsh foot-gun that seems to harm many ollama users. That 2k default is extremely low, and ollama *silently* discards the leading context. So users have no idea that most of their data hasn’t been provided to the model. I’ve had to add docs [0] to aider about this, and aider overrides the default to at least 8k tokens.
I’d like to do more, but unilaterally raising the context window size has performance implications for users. Edit: Ok, aider now gives ollama users a clear warning when their chat context exceeds their ollama context window [1]. [0] https://aider.chat/docs/llms/ollama.html#setting-the-context... [1] https://github.com/Aider-AI/aider/blob/main/aider/coders/bas... |
|
Fortunately it's easy to create a variant of the model with increased context size using the CLI[3] and then use that variant instead.
Just be mindful that longer context means more memory required[4].
[1]: https://github.com/ollama/ollama/issues/4967
[2]: https://github.com/ollama/ollama/issues/7043
[3]: https://github.com/ollama/ollama/issues/8099#issuecomment-25...
[4]: https://www.reddit.com/r/LocalLLaMA/comments/1848puo/comment...