|
|
|
|
|
by maiybe
527 days ago
|
|
Under the hood, we're supporting multiple models that can be selected, but haven't optimized all the quantizations possible (the space is moving fast). The range is 1GB - 24GB, depending on model selection, but would be amazing to push lower than that. 24GB is high end as only the NVIDIA XX90s can support those. |
|