| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by brunooliv 32 days ago
	Any reason why they indexed on Kimi K2.5 model? I have tried many open-source ones in Opencode, and, in my experience (standard backend development, Java, Python, Spring, etc) Qwen3.6 is SO MUCH BETTER that's shocking. Kimi can't even get most tool calling arguments right.

3 comments

CuriouslyC 32 days ago

There's a lead time on models, and there's some tuning gotchas they probably already figured out with Kimi, so they weren't ready to just drop everything and switch. I'm sure they will switch models eventually.

link

roflcopter69 32 days ago

I recommend reading the entire article

  Together with SpaceXAI, we're training a significantly larger model from scratch, using 10x more total compute.
  With Colossus 2's million H100-equivalents and our combined data and training techniques, we expect this to be a major leap in model capability.

link

grim_io 32 days ago

I guess this will largely decide if xai is going to pay 60 or 10 billion, depending on the success of the new coding model.

link

KaoruAoiShiho 32 days ago

Kimi 2.5 has the best long context. For raw coding benchmark scores you can just post train on top of it with more specialized data. 2.5 is kinda old, 2.6 is the current release which is exactly just that and catches up to the frontier in most aspects.

link

Bombthecat 32 days ago

Cheaper to run?

link