|
|
|
|
|
by Y_Y
508 days ago
|
|
So it's a replacement for Ollama? The killer features of Ollama for me right now are the nice library of quantized models and the ability to automatically start and stop serving models in response to incoming requests and timeouts. The first send to be solved by reusing the Ollama models, but I can't see if the service is possible from my cursory look. |
|