| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Gcam 851 days ago
	As part of our benchmarking of Groq we have asked Groq regarding quantization and they have assured us they are running models at full FP-16. It's a good point and important to check. Link to benchmarking: https://artificialanalysis.ai/ (Note question was regarding API rather than their chat demo)