Hacker News new | ask | show | jobs
by Langdal 3348 days ago
Note that they only compared their solution with Eigen’s tensor transposition functionality. HPTT is not a drop-in replacement.
1 comments

Does anyone know how much time a typical TensorFlow model spends in transposition routines?
No hard numbers to present, but it would be beneficial in long-sequence LSTM networks, because TensorFlow has to do time-major <=> batch-major transposition between steps.