Y
Hacker News
new
|
ask
|
show
|
jobs
by
bestcoder69
1188 days ago
Anyone know of a 65B instruct model I can run on llama.cpp? Or if it’s possible to train one on a 64gb Mac Studio? Haven’t come across either yet.
2 comments
sahil_chaudhary
1188 days ago
I'm training a 65B model right now, also I believe you can use lora-alpaca to train on this data on a much smaller machine.
link
cfn
1188 days ago
Llama has a 65B model and it runs (albeit slowly) on a 64gb Mac Studio with llama.cpp.
link
bestcoder69
1188 days ago
Yea I have that running. Looking specifically for a 65B instruction-tuned model. I just found a way to get alpaca 30B going though.
link