|
|
|
|
|
by zackify
394 days ago
|
|
Ollama breaks for me. If I manually set the context higher. The next api call from clone resets it back. And ollama keeps taking it out of memory every 4 minutes. LM studio with MLX on Mac is performing perfectly and I can keep it in my ram indefinitely. Ollama keep alive is broken as a new rest api call resets it after. I’m surprised it’s this glitched with longer running calls and custom context length. |
|