Hacker News new | ask | show | jobs
by bestcoder69 1188 days ago
Anyone know of a 65B instruct model I can run on llama.cpp? Or if it’s possible to train one on a 64gb Mac Studio? Haven’t come across either yet.
2 comments

I'm training a 65B model right now, also I believe you can use lora-alpaca to train on this data on a much smaller machine.
Llama has a 65B model and it runs (albeit slowly) on a 64gb Mac Studio with llama.cpp.
Yea I have that running. Looking specifically for a 65B instruction-tuned model. I just found a way to get alpaca 30B going though.