| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by jjcm 57 days ago
	This is probably less likely with this model, as it’s almost certainly a further RL training continuation of 3.5 27b. The bugs with this architecture were worked out when that dropped.

1 comments

Valuable note!