Hacker News new | ask | show | jobs
by jamifsud 1096 days ago
Are there communities where one can go to learn more about fine tuning and running these things? I've found a bunch for diffusion models but haven't had any luck with LLMs.
4 comments

It's not a community per se but there's a lot of research and discussion going on directly in the llama.cpp repo (https://github.com/ggerganov/llama.cpp) if you're interested in the more technical side of things.
Sorry for self-promotion but I wrote something on this today: https://adamfallon.com/ai/llms/deep-learning/machine-learnin...
The discord servers for a few of the projects are relatively popular. Most have a help channel you could post in if you have questions. The Discord for KoboldAI has some developers from koboldcpp ,which is the easiest and one of the most bleeding edge way of running these models locally. It builds on llamacpp and allows the use of different front ends among other things like using k quantized models. People also have had success with using something like runpods.

Native fine tuning is still out of consumer reach for the forseeable future, but there's people experimenting with QLORAs. The pipeline is still relatively new though and is a bit involved.

https://koboldai.org/discord

https://github.com/LostRuins/koboldcpp

There is a bit of activity on /r/localllama
Any outside of reddit?
4chans lmg thread on /g/
There's a LocalLLaMA Lemmy instance at https://sh.itjust.works/c/localllama