Hacker News new | ask | show | jobs
by freehorse 472 days ago
If you did not change the context length, it is certain that it is not 2k or so. In "/show info" there is a field "context length" which is about the model in general, while "num_ctx" under "parameters" is the context length for the specific chat.

I use modelfiles because I only use ollama because it has easy integration with other stuff eg with zed, so this way I can easily choose models with a set context size directly.

Here nothing fancy, just

    FROM qwq
    PARAMETER num_ctx 100000
You save this somewhere as a text file, you run

    ollama create qwq-100k -f path/to/that/modelfile
and you now have "qwq-100k" in your list of models.