| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by baptiste1 972 days ago
	Yes, your understanding is correct. However, instead of adding a head on top of the network, most fine-tuning is currently done with LoRA (https://github.com/microsoft/LoRA). This introduces low-rank matrices between different layers of your models, those are then trained using your training data while the rest of the models' weights are frozen.