|
|
|
|
|
by masahi
2954 days ago
|
|
The TVM results on resnet50 and mobilenet seem a bit off. On GTX 1070 Ti, with an input of size (1, 3, 224, 224) TVM result Resnet50 : 100 inference/sec (0.009983 sec per each run) Mobilenet: 450 inference/sec (0.002220 sec per each run) PlaidML result Resnet50 : 107 inference/sec (0.009302 sec per each run) Mobilenet: 473 inference/sec (0.002112 sec per each run) My benchmark script for tvm is here
https://gist.github.com/masahi/a386c2ce5b5f8c2d9f7af5e09a8d8... |
|
If I just pull the overall kernel runtime from our logs, I get ~525 inferences/sec.