Y
Hacker News
new
|
ask
|
show
|
jobs
by
johnsutor
625 days ago
Seems like this is already being answered:
https://arxiv.org/abs/2407.10930
https://arxiv.org/abs/2006.04439
1 comments
valine
625 days ago
Not really the first paper is just fine-tuning on synthetic data. The second paper doesn’t optimize the model weights.
link