Hacker News new | ask | show | jobs
by nqzero 856 days ago
is there an existing SLM that resembles an LLM in architecture that includes the code for training it ?

i realize the cost and time to train may be prohibitive and that quality on general english might be very limited, but is the code itself available ?

1 comments

Not sure what you mean with SLM, but https://github.com/karpathy/nanoGPT