Hacker News new | ask | show | jobs
by Alex-Programs 471 days ago
This is ridiculous. 32B and beating deepseek and o1. And yet I'm trying it out and, yeah, it seems pretty intelligent...

Remember when models this size could just about maintain a conversation?

2 comments

I still remember Vicuna-33B, that one stayed on the leaderboards for quite a while. Today it looks like a Model T, with 1B models being more coherent.
Have you tried it as yet? Don't fall for benchmark scores.