Hacker News new | ask | show | jobs
by Akashic101 934 days ago
I am currently planning my own small LLM trained on documents we use internally for work. Does anyone have any tips and tricks on how to make this work the best? Could a project like Llamafile help me with this, even if it is just for testing?
3 comments

How did you choose between training a model from scratch vs using retrieval augmented generation with an existing off-the-shelf model? From what I've observed, RAG + off-the-shelf model seems to be the more common approach for use cases like "create LLM that answers questions about my company's internal documentation", particularly because the iteration/improvement cycle is much shorter-- it's much easier to iterate on RAG/prompts vs. training a whole new model to improve it. (If the answer is "I just wanted to try training a whole new llm", I won't fault you for that! :) )
I would like to be able to train an LLM on absolutely everything I have starred at Github.
Seconded. My copy launched without problem - now I want to learn how to ingest my library of blog posts, PDFs & images.