Hacker News new | ask | show | jobs
by tikkun 1115 days ago
I compiled this list - https://gist.github.com/TikkunCreation/5de1df7b24800cc05b482...

In particular, you'll probably want to skip to nanoGPT (https://github.com/karpathy/nanoGPT) and then maybe if you are interested in a bit more of the theory, Zero to Hero (https://karpathy.ai/zero-to-hero.html), and his comments in one of the threads linked: https://news.ycombinator.com/item?id=34414716

Fine tuning may also be a faster and better place to start, this is a good guide for fine tuning some publicly released LLMs: https://erichartford.com/uncensored-models

1 comments

Thank you!