Hacker News new | ask | show | jobs
by namibj 2951 days ago
I just wanted to caution that one has to be careful what one is comparing against, as the libraries got significant speed improvements over time, without that being widely advertised. So it matters a lot if one compares this library against CuDNN from 1 month ago, or to CuDNN from 2 years ago. The latter is _much_ slower.

The GEMM example was just there as the details of the optimization have been published, unlike most other hand-tuned assembler routines for DNN workloads.

1 comments

CuDNN v7 was used in the experiments, in the experiments parts each comparison was listed with version or commit number.
Well, I didn't RTFA. This was not meant to be specific for this article though.