Hacker News new | ask | show | jobs
by paolop 926 days ago
What's your experience with open source LLMs like LLaVA 1.5 or Mistral 7B?
2 comments

The cognitivecollective fine-tune for Mistral-7B is by far the best small model I've found. https://huggingface.co/TheBloke/CollectiveCognition-v1.1-Mis...

The llava multi-modal models are fun. I find requesting json formatted output lets you overcome the limited response length baked in. https://huggingface.co/mys/ggml_bakllava-1 (a CLIP+Mistral-7B instead of CLIP+llama2-7B) is my favorite.

The fine-tunes of Mistal 7B, open-Hermes-2.5 and OpenOrca are good. Zephyr is underwhelming.