| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by paulcjh 1054 days ago
	Any performance benchmarks compared to other LLMs? Also, any performance increases on the orig Falcon model in inference speed? We ditched most of our focus on Falcon 40B after Llama 2 70B came out, both the tokens per sec and quality of results are not even close.

2 comments

brianjking 1054 days ago

I'm assuming this will have similar scores to the original 40B model, in which case LLaMa2 70b would outperform it. The avg score on the Open LLM Leaderboard of LLaMa2 70b instruct is 72.3.

Falcon-40B is 63.4 or 61.5 on the non instruction tuned version.

link

Tepix 1054 days ago

Did you fine-tune your own Llama 2? Llama2-chat is awfully gimped.

link