| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by m11a 314 days ago

I tried these models half-sceptically.

I ended up blown away. via Cerebras/Groq, you're looking at around 1000 tok/sec for the 120B model. For gentic code generation, I found the abilities to exceed gpt-4.1. Tool calling was surprisingly good, albeit not as good as Qwen3 Coder for me.

It's a very capable model, and a very good release. The high throughput is a game changer.