Hacker News new | ask | show | jobs
by Nischalj10 980 days ago
what is the best way to fine-tune these models? any good resources would be very helpful. TIA /\

PS - I have a brief background in Machine Learning, more in development.

1 comments

Jeremy Howard talks about it in his recent video "A Hackers' Guide to Language Models": https://youtu.be/jkrNMKz9pWU?t=4808

That link goes directly to the timestamp where he discusses fine tuning, but the whole talk is great. Punchline, check out Axolotl: https://github.com/OpenAccess-AI-Collective/axolotl

This is a walkthrough based on that talk for fine-tuning with axolotl https://dzlab.github.io/dltips/en/pytorch/llama-2-finetuning...