Hacker News new | ask | show | jobs
by pama 693 days ago
Please look at any of the plain pytorch codes by Karpathy that complement llm.c. If you want scalable codes, please look at Megatron-LM.