|
|
|
|
|
by lyu07282
369 days ago
|
|
What what I was referring to is similar in concept, but I've seen both described in papers as distillation. What I meant was you take the output of a large model like GPT4 and use that as training data to fine-tune a smaller model. |
|