Hacker News new | ask | show | jobs
by rahimnathwani 1095 days ago
AIUI it uses the Llama architecture, but not Facebook's Llama weights. It uses MPT-7B, which was trained from scratch: https://www.mosaicml.com/blog/mpt-7b