Hacker News new | ask | show | jobs
by piotraleksander 59 days ago
it's such a misinformed statement, as kimi2.5 was used as a base model for composer 2 and then heavily RLed
1 comments

What does heavy RL even mean…similar to how the CEO of cursor said how much better the perplexity got when it’s a terrible metric for model fine tune performance? Let’s be real here, it’s Kimi 2.5 fine tuned for Cursor. There’s nothing wrong with that but they tried to hide it and it’s some work they put in but nothing close to training a model of their own.