|
|
|
|
|
by sudosysgen
510 days ago
|
|
Again, this isn't how distillation work. Your task as the distillation model is to copy mistakes, and you will be penalized by pruning reconciling and generating. "Play and reflection" is something else, which isn't distillation. |
|