Hacker News new | ask | show | jobs
by majorchord 66 days ago
> the ability to "hotswap" models with different utility instead of restarting the server

The article mentions llama-swap does this