Y
Hacker News
new
|
ask
|
show
|
jobs
by
rahimnathwani
1095 days ago
AIUI it uses the Llama architecture, but not Facebook's Llama weights. It uses MPT-7B, which was trained from scratch:
https://www.mosaicml.com/blog/mpt-7b