| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by valine 625 days ago
	Not really the first paper is just fine-tuning on synthetic data. The second paper doesn’t optimize the model weights.