Hacker News new | ask | show | jobs
by warkdarrior 655 days ago
> 3B parameter model

That's tiny. Can it train/fine-tune 70B models?