Y
Hacker News
new
|
ask
|
show
|
jobs
by
warkdarrior
655 days ago
> 3B parameter model
That's tiny. Can it train/fine-tune 70B models?