Hacker News new | ask | show | jobs
by simplyluke 87 days ago
Any investor who believed a team their size and with their capital was training a SOTA base model doesn't understand the space. I fully believe that was some of their investors, but people acting like RL + fine tuning based on their massive user base that's producing qualitatively better outputs than the base model is meaningless aren't understanding what the company is doing.
1 comments

Could you explain how much improvement RL+fine tuning can give with respect to Composer 2.0 model over Kimi K2.5? I don't fully grasp the work Cursor model has done here.