Hacker News new | ask | show | jobs
by XCSme 267 days ago
PS: I thought Ollama had a way to use RAM instead of VRAM (?) to keep the model active when not in use, but in my experience that didn't solve the problem.