|
|
|
|
|
by djhworld
368 days ago
|
|
With system builds like this I always feel the VRAM is the limiting factor when it comes to what models you can run, and consumer grade stuff tends to max out at 16GB or (somemtimes) 24GB for more expensive models. It does make me wonder whether we'll start to see more and more computers with unified memory architecture (like the Mac) - I know nvidia have the Digits thing which has been renamed to something else |
|
So there’s a fundamental tradeoff between cost, inference speed, and hostable model size for the foreseeable future.