|
|
|
|
|
by sunpazed
906 days ago
|
|
> While LLM projects typically require an exorbitant amount of resources, it is important to remind ourselves that research does not need to assemble full-fledged massively expensive systems in order to have impact. Check out TinyLlama; https://github.com/jzhang38/TinyLlama Four research students from the Singapore University of Technology and Design are pretraining a 1.1B Llama model on 3 trillion tokens using a handful of A100's. They're also providing the source code, training data, and fine-tuned checkpoints for anyone to run. |
|