Pure Go hardware accelerated local inference on VLMs using llama.cpp

Y	Hacker News new \| ask \| show \| jobs

	Pure Go hardware accelerated local inference on VLMs using llama.cpp (github.com)
	1 points by deadprogram 223 days ago