|
|
|
|
|
by zyx321
314 days ago
|
|
It's not a fixed split. I don't know if it's possible live, or if it requires a reboot, but it's not hardwired. I want to know if it's possible. 4GB for Linux, a bit of room for the calculations, and then you can load a 122GB model entirely into VRAM. How would that perform in real life? Someone please benchmark it! |
|
I have that split set at the minimum 2 GB and I'm giving the GPU a 20 GB model to process.