|
|
|
|
|
by m11a
314 days ago
|
|
I tried these models half-sceptically. I ended up blown away. via Cerebras/Groq, you're looking at around 1000 tok/sec for the 120B model. For gentic code generation, I found the abilities to exceed gpt-4.1. Tool calling was surprisingly good, albeit not as good as Qwen3 Coder for me. It's a very capable model, and a very good release. The high throughput is a game changer. |
|