|
|
|
|
|
by Isuckatcode
736 days ago
|
|
>By fine-tuning only the adapter layers, the original parameters of the base pre-trained model remain unchanged, preserving the general knowledge of the model while tailoring the adapter layers to support specific tasks. From a ML noob (me) understanding of this, does this mean that the final matrix is regularly fine tuned instead of fine tuning the main model ? Is this similar to how chatGPT now remembers memory[1] ? [1] https://help.openai.com/en/articles/8590148-memory-faq |
|
The advantage of the adaptor matrices is you can have different sets of adaptor matrices for different tasks, all based of the base model.