| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ryanklee 896 days ago
	It's an LLM technology that allows certain models to run on CPUs rather than big beefy GPUs. Makes running locally viable for consumers.

1 comments

muricula 896 days ago

Is there a specific paper or something you can point me to? Or are you talking about like llama.cpp? Because I thought that referred to the fact that it was originally one c++ file named llama.cpp?

link

ryanklee 896 days ago

I assumed it was in reference to llama.cpp. It's a weak assumption, though.

link

qup 895 days ago

The guy meant CCP, I'm pretty sure.

link