|
|
|
|
|
by FrostKiwi
1163 days ago
|
|
Fascinating progress. Would you say the following understanding is correct?: - You can fine-tune a model, regardless of whether it has been quantized (as in the 4-bit versions of models made to fit in consumer grade RAM sizes) or not. - You can fine-tune any model on any hardware, provided it fits into RAM. That means, that the 30B llama-derived models in their 4-bit quantized version and 19.5GB of VRAM requirement can be fine-tuned on consumer grade GPUs with 24gb of VRAM. (Like the RTX 3090 and 4090) |
|
To the second, I'm not sure that the RAM requirements are the same to train because you have to preserve the state which takes extra memory.