Hacker News new | ask | show | jobs
by miven 916 days ago
There seems to be an experimental Mistral Medium model listed among other available model endpoints on [1], the comparison table they give shows that it outmatches 8x7B by a few percent on every benchmark listed

[1] https://mistral.ai/news/la-plateforme/

1 comments

It apparently outperforms GPT-4 at WinoGrande as well…