| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by markusheimerl 5 days ago
	Sure it could be extended to support LoRA finetuning but this implementation has the goal to be as lean and efficient as possible for a pre-training stack as you can be.