|
|
|
|
|
by andy99
902 days ago
|
|
I've been using one of the earlier checkpoints for benchmarking a Llama implementation. Completely anecdotally I feel at least as good or better about this one than the earlier openllama 3B. I wouldn't use either of them for RAG or anything requiring more power, just to say that it's competitive as a smaller model, whatever you use those for, and easy to run on CPU at FP16 (meaning without serious quantization). |
|
https://github.com/rbitr/llm.f90/tree/optimize16/purefortran