Hacker News new | ask | show | jobs
by gavmor 525 days ago
Yes, Ollama automatically determines the number of layers to offload based on available VRAM.