Y
Hacker News
new
|
ask
|
show
|
jobs
by
Alex-Programs
471 days ago
This is ridiculous. 32B and beating deepseek and o1. And yet I'm trying it out and, yeah, it seems pretty intelligent...
Remember when models this size could just about maintain a conversation?
2 comments
moffkalast
471 days ago
I still remember Vicuna-33B, that one stayed on the leaderboards for quite a while. Today it looks like a Model T, with 1B models being more coherent.
link
dcreater
470 days ago
Have you tried it as yet? Don't fall for benchmark scores.
link