| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Eisenstein 58 days ago
	If the card supports vulkan and the model has gguf weights. llamacpp has excellent vulkan support that is being actively developed and is not that far behind CUDA where speed is concerned. * https://github.com/ggml-org/llama.cpp/releases