Hacker News new | ask | show | jobs
by segmondy 806 days ago
With open models, yes we are at the performance of at least the first release of ChatGPT 4.
1 comments

Could you recommend one or a few in particular?
The current best open weights model is probably Cohere Command-R+. The memory requirements on it are quite high, though.
I really want to see some benchmarks with performance weighted by energy use. I think Mistral 7B performance to watt would be the leader by a huge margin. On many tasks I get equal performance on zero shot classification tasks on Mistral than in bigger models.