|
|
|
|
|
by ericb
396 days ago
|
|
In ollama, how do you set up the larger context, and figure out what settings to use? I've yet to find a good guide. I'm also not quite sure how I should figure out what those settings should be for each model. There's context length, but then, how does that relate to input length and output length? Should I just make the numbers match? 32k is 32k? Any pointers? |
|
Just for ollama, see: https://github.com/ollama/ollama/blob/main/docs/faq.md#how-c...
I’m using llama.cpp though, so I can’t confirm these methods.