Y
Hacker News
new
|
ask
|
show
|
jobs
by
ryanklee
896 days ago
It's an LLM technology that allows certain models to run on CPUs rather than big beefy GPUs. Makes running locally viable for consumers.
1 comments
muricula
896 days ago
Is there a specific paper or something you can point me to? Or are you talking about like llama.cpp? Because I thought that referred to the fact that it was originally one c++ file named llama.cpp?
link
ryanklee
896 days ago
I assumed it was in reference to llama.cpp. It's a weak assumption, though.
link
qup
895 days ago
The guy meant CCP, I'm pretty sure.
link