|
|
|
|
|
by Aurornis
8 days ago
|
|
> 2. The improvement would come from merging the weights PLUS on-policy distillation. The confusion is that the uploaded model didn't have the distillation at all. They merged the base model with another lab’s fine tuned model. The improvements could have come from getting some of the fine tuned weights from the other model. If they really had a better performing model that they “accidentally” forgot to upload, they could have uploaded the correct file by now. |
|
https://news.ycombinator.com/item?id=48529544