|
|
|
|
|
by smodad
1106 days ago
|
|
I just realized that Justine was the person responsible for the massive reduction in the memory footprint of the Llama models back in March.[1] Super impressive! These are my favorite kinds of blog posts. [1] https://github.com/ggerganov/llama.cpp/pull/613 |
|