However they completely messed up benchmarking experiments for various RNN models which in their papers claim comparable and even better performance than base transformer.
However they completely messed up benchmarking experiments for various RNN models which in their papers claim comparable and even better performance than base transformer.