Hacker News new | ask | show | jobs
by johnsutor 625 days ago
Seems like this is already being answered:

https://arxiv.org/abs/2407.10930 https://arxiv.org/abs/2006.04439

1 comments

Not really the first paper is just fine-tuning on synthetic data. The second paper doesn’t optimize the model weights.