Y
Hacker News
new
|
ask
|
show
|
jobs
by
woadwarrior01
1739 days ago
Yes. Non tensor core fp16 ops are the default. Tensor cores are essentially 4x4 fp16 mac units and there's a requirement that matrix dimensions are multiples of 8[1] that needs to be met for them to be used.
[1]:
https://docs.nvidia.com/deeplearning/performance/mixed-preci...