|
|
|
|
|
by Daegalus
1158 days ago
|
|
Im not an expert, and I don't have nvidia, but I assume you need to setup CUDA and install the CUDA pytorch stuff? Most docs Ive read on setting up finetuners and inference require some extra stuff. Taking some LORA fine tuners, they include instructions like this: conda create -n llm-finetuner python=3.10
conda activate llm-finetuner
conda install -y cuda -c nvidia/label/cuda-11.7.0
conda install -y pytorch=2 pytorch-cuda=11.7 -c pytorch
When I experimented with Stable Diffusion and ROCM (amd card), i had to do similar but with pythorch-rocm. and when I was doing a CPU only, did `pytorch-cpu`. So maybe your attempt didn't use the GPUs at all, because 12 mins is about what I had on a CPU for inference on other models of similar size. |
|