| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ramesh31 980 days ago
	Is it better than llama 2?

3 comments

tarruda 980 days ago

It is better than llama 2 7b and 13b. I tried the OpenOrca fine tune and it is very good, even when 4-bit quantized

link

faizshah 979 days ago

What does OpenOrca do? It’s just instruction tuning it?

link

tarruda 979 days ago

Yes, it is a instruction tune dataset: https://huggingface.co/datasets/Open-Orca/OpenOrca

It felt different from the official Mistral7B-Instruct. One of the highlights with the OpenOrca version is that you can steer the model with a system prompt (eg "You are a 5 year old")

link

sebzim4500 980 days ago

For its size, yes. In absolute terms it is obviously less capable than llama-2-70B

link

espadrine 979 days ago

For now. Huggingface[0] mentioned a DPO-fine-tuned version, Zephyr 7B, which it claims is competitive with Llama2-70B[1].

[0]: https://huggingface.co/spaces/HuggingFaceH4/zephyr-chat

[1]: https://twitter.com/huggingface/status/1711780979574976661

link

brucethemoose2 979 days ago

Oh, they uploaded the weights. I missed this one, cheers!

link

andai 979 days ago

I found llama-2-70B to be a bit worse than GPT-4. (So, pretty good!) But I did not compare with GPT-3.

How do llama-2-70B and Mistral 7B compare with GPT-3?

link

TheRoque 980 days ago

Yes

link