|
|
|
|
|
by wing-_-nuts
755 days ago
|
|
The general rule is that VRAM == parameter count in billions (I'm generalizing gguf finetunes here) 8GB vram cards can run 7B models 16GB vram cards can run 13B models 24GB vram cards can run up to 33B models Now to your question, what can most computers run? You need to look at the tiny but specialized models. I would think 3B models could be ran reasonably well even on the CPU. Intellij has a absolutely microscopic < 1B model that it uses for code completion locally. It's quite good and I don't notice any delay. |
|