Hacker News new | ask | show | jobs
by blurpesec 916 days ago
Mistral-7B is surprisingly decent for general purpose small tasks. The more complex the task or the more specific the knowledge recall, the worse the performance since the smaller the models are - the less breadth they tend to have.

But they're very nice for making PoCs on complex systems since they're near free to run.

2 comments

There's even a new version of the Mistral 7b out there today that should be a lot better, v 0.2.

The finetunes of 0.1 are already extremely impressive at general tasks.

https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GGU...

Have you tried the 0.2 version they released yesterday? Curious if you’re seeing significant improvements