Thankfully for a good chunk of number crunching that works fine. But the other side of the coin is notably AI workloads. There's no OpenCL or Vulkan standard for exposing matrix units, only vendor specific ones.
For OpenCL: cl_qcom_ml_ops (Qualcomm) notably,
for Vulkan: VK_NV_cooperative_matrix (NVIDIA)
For OpenCL: cl_qcom_ml_ops (Qualcomm) notably, for Vulkan: VK_NV_cooperative_matrix (NVIDIA)