| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by woadwarrior01 1785 days ago
	Yes. Non tensor core fp16 ops are the default. Tensor cores are essentially 4x4 fp16 mac units and there's a requirement that matrix dimensions are multiples of 8[1] that needs to be met for them to be used. [1]: https://docs.nvidia.com/deeplearning/performance/mixed-preci...