Y
Hacker News
new
|
ask
|
show
|
jobs
by
ipiszy
1360 days ago
You could check "AITemplate optimizations" section in the blog (
https://ai.facebook.com/blog/gpu-inference-engine-nvidia-amd...
), and
https://github.com/facebookincubator/AITemplate#more-about-a...
. The basic idea is to do aggressive kernel fusions.