| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by johnsutor 625 days ago
	Seems like this is already being answered: https://arxiv.org/abs/2407.10930 https://arxiv.org/abs/2006.04439

1 comments

Not really the first paper is just fine-tuning on synthetic data. The second paper doesn’t optimize the model weights.