Hacker News new | ask | show | jobs
by sanjiwatsuki 803 days ago
The current best open weights model is probably Cohere Command-R+. The memory requirements on it are quite high, though.
1 comments

I really want to see some benchmarks with performance weighted by energy use. I think Mistral 7B performance to watt would be the leader by a huge margin. On many tasks I get equal performance on zero shot classification tasks on Mistral than in bigger models.