Hacker News new | ask | show | jobs
by brunooliv 32 days ago
Any reason why they indexed on Kimi K2.5 model? I have tried many open-source ones in Opencode, and, in my experience (standard backend development, Java, Python, Spring, etc) Qwen3.6 is SO MUCH BETTER that's shocking. Kimi can't even get most tool calling arguments right.
3 comments

There's a lead time on models, and there's some tuning gotchas they probably already figured out with Kimi, so they weren't ready to just drop everything and switch. I'm sure they will switch models eventually.
I recommend reading the entire article

  Together with SpaceXAI, we're training a significantly larger model from scratch, using 10x more total compute.
  With Colossus 2's million H100-equivalents and our combined data and training techniques, we expect this to be a major leap in model capability.
I guess this will largely decide if xai is going to pay 60 or 10 billion, depending on the success of the new coding model.
Kimi 2.5 has the best long context. For raw coding benchmark scores you can just post train on top of it with more specialized data. 2.5 is kinda old, 2.6 is the current release which is exactly just that and catches up to the frontier in most aspects.
Cheaper to run?