|
|
|
|
|
by jmorgan
741 days ago
|
|
Shoot sorry about that. There's a few ways to keep the model loaded in memory: 1. If using `ollama run`: `ollama run llama3 --keepalive -1` 2. If running ollama serve directly, use `OLLAMA_KEEP_ALIVE=-1` ollama serve 3. If using the api, there's a `keep_alive` parameter you can set to -1 |
|