| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Alex-Programs 471 days ago
	This is ridiculous. 32B and beating deepseek and o1. And yet I'm trying it out and, yeah, it seems pretty intelligent... Remember when models this size could just about maintain a conversation?

2 comments

moffkalast 471 days ago

I still remember Vicuna-33B, that one stayed on the leaderboards for quite a while. Today it looks like a Model T, with 1B models being more coherent.

link

dcreater 470 days ago

Have you tried it as yet? Don't fall for benchmark scores.

link