|
|
|
|
|
by Ms-J
1044 days ago
|
|
I very much do appreciate your comment and will look into into llama.cpp.
Was it from here: https://github.com/ggerganov/llama.cpp Do you have a guide that you followed and could link it to me or was it just from prior knowledge? Also, do you know if I could run the Wizard Vicuna on it? That model isn't listed on the above page. |
|
https://replicate.com/blog/run-llama-locally
I found that guide here on hn.
I run it cpu only with 16 threads but yeah perf is good enough.
BTw my 6gb figure is me.measuring from htop so llama2 is likely less.