Hacker News new | ask | show | jobs
by sahil_chaudhary 1188 days ago
I'm training a 65B model right now, also I believe you can use lora-alpaca to train on this data on a much smaller machine.