Hacker News new | ask | show | jobs
by jdecourval 804 days ago
I have experience working with Large Language Models (LLMs) during my tenure at my previous company. However, one aspect I didn't get the opportunity to explore there was fine-tuning my own LLM, so I decided to experiment.

While there are many pieces of documentation available on this topic scattered across the Internet, putting them all together is not obvious. Therefore, in this repository, I present the full training code, from scrapping, to generating the dataset to the finetuning itself.

The whole thing takes a few days to complete, and it does work on my AMD GPU!