| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by summarity 1059 days ago
	If you're running on Ampere, using llama.cpp is probably not ideal. While it's optimized for ARM, Ampere has native acceleration for workloads like this: https://cloudmarketplace.oracle.com/marketplace/en_US/adf.ta...