Hacker News new | ask | show | jobs
by upbeat_general 1361 days ago
Do you know of any good explanations of the techniques you used for those who only touch PyTorch Eager + occasionally torchscript?
1 comments

You could check "AITemplate optimizations" section in the blog (https://ai.facebook.com/blog/gpu-inference-engine-nvidia-amd...), and https://github.com/facebookincubator/AITemplate#more-about-a.... The basic idea is to do aggressive kernel fusions.