Hacker News new | ask | show | jobs
by freehorse 469 days ago
How does it compare to qwen32b-r1-distill? Which is probably the most directly comparable model.
1 comments

I'm wondering as well. Here in open llm leaderboard there is only preview. Better than deepseek-ai/DeepSeek-R1-Distill-Qwen-32B but surprisingly worse than deepseek-ai/DeepSeek-R1-Distill-Qwen-14B

in Open LLM leaderboard overall this model is ranked quite low at 660: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_...