Is deepmind moving back to (py)torch?

I'm not a Googler/Deepminder, so I can't comment about what goes on inside the company.

However, Google is making a hardware investment in Tensor Processing Units. These presumably offer hardware acceleration for the static computation graphs that TensorFlow produces, and PyTorch wouldn't be any good with them.

You're right that--as of this time of writing--there are no good seq2seq with attention models in Keras. I think there are a few attempts on Github, but I haven't tried them yet. I don't know anybody else that has tried seq2seq w/ attention in Keras yet either.

Additionally, TensorFlow has a seq2seq module, and it does come with an attention mechanism. See https://github.com/google/seq2seq/blob/master/seq2seq/models...

Anyway, I think the best thing for folks like you and me is to just keep using PyTorch for research work, and use TensorFlow for certain deployment scenarios where TensorFlow is superior--like mobile apps and Google Cloud.