Y
Hacker News
new
|
ask
|
show
|
jobs
by
visarga
1170 days ago
Needs a big honking datacenter or billions of compute credits and safety for 6-12 months.
1 comments
tanseydavid
1170 days ago
Doesn't Alpaca seem to suggest that assumption is no longer true?
link
visarga
1169 days ago
Alpaca rides on LLaMA. And LLaMA was trained on 1T tokens for a long time. The fine-tuning takes one hour with low rank adaptation. But pre-training a new model takes months.
link