| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by always2slow 1038 days ago
	>That's a rather odd comparison to make. First of all, OP, like llama.cpp, doesn't use the GPU When was the last time you looked at llama.cpp? It has supported GPU, GPU+CPU, and distributed inference using OpenMPI for awhile now. It also supports training, as well as negative prompting and grammars! The ease of getting llama.cpp running on just about anything has already started innovation.