| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by Deverauxi 842 days ago

Well.

Last time round we got our hands on 7B-70b models.

Presumably, a company with as much compute as Meta can train even larger models that weren’t released, that would greatly benefit from the massive global efforts expended for free on the smaller open source LLM family tree of models. And their architecture ends up largely adopted by the open source community, helping build advanced tools to utilize those models.