| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by tarruda 650 days ago
	Have you ran the model in full FP16? It is possible a lot of performance is lost when running quantized versions.