| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by simplyluke 87 days ago
	Any investor who believed a team their size and with their capital was training a SOTA base model doesn't understand the space. I fully believe that was some of their investors, but people acting like RL + fine tuning based on their massive user base that's producing qualitatively better outputs than the base model is meaningless aren't understanding what the company is doing.

1 comments

ajitid 81 days ago

Could you explain how much improvement RL+fine tuning can give with respect to Composer 2.0 model over Kimi K2.5? I don't fully grasp the work Cursor model has done here.

link