Hacker News new | ask | show | jobs
by imjonse 844 days ago
Most (all?) open-ish 7B+ models today are finetunes of proprietary/semi-closed/bigbudget LLMs. There is no such foundation model for Mamba yet.