|
|
|
|
|
by stingraycharles
39 days ago
|
|
I don’t think what you’re saying makes a lot of sense. You don’t “fill the KV cache with whatever alignment you want.” That doesn’t exist. The KV cache is an inference optimization, and is populated by running tokens through the model. Synthetic data is generated by other models, and yes this is often where identity propagates. I think with the snowballing you mean things like iterative self distillation? That’s definitely not done unsupervised, because of the risk of model collapse, and typically heavily curated and/or mixed with real data. |
|