Show HN: I built a website where you can easily fine-tune Llama 3.1 models

Great point! Fine-tuning is the lowest level in the hierarchy of LLM control. Initially, people try prompting and that works well for a lot of use cases. When prompting is insufficient or the LLM hallucinates too much then you would resort to RAG. RAG is essentially doing a vector search over some data and then feeding the results into the prompt so that the LLM has the relevant context. This can significantly reduce hallucinations. However, RAG has its limitations. Perhaps you want the model to learn a particular behaviour (or get extremely good at it) or you want to fine-tune a much smaller (and therefore much cheaper) model to get very good at something (as good as a much bigger and smarter model), in these cases you would explore fine-tuning.

So currently I'm mainly targeting researchers or startup founders who are at the cutting edge of AI as they will likely want to explore fine-tuning.

I suspect that fine-tuning will become much more ubiquitous over time.

But thank you for the great question