|
|
|
Is deepmind moving back to (py)torch?
|
|
4 points
by UCAN2
3192 days ago
|
|
Hi,
I am working on natural language entailments tasks with keras but despite some searching around I couldn't find any convincing example of an seq2seq with attention model in Keras (or a neural cache model for that matter)
On the other hand I just looked at pytorch for a couple of hours and dynamic/control flow operations seems really easy to code in pytorch .
Is everyone progressively moving to pytorch while only using tensorflow as a backend for keras ? Is the pytorch movement even spreading inside google ? |
|
However, Google is making a hardware investment in Tensor Processing Units. These presumably offer hardware acceleration for the static computation graphs that TensorFlow produces, and PyTorch wouldn't be any good with them.
You're right that--as of this time of writing--there are no good seq2seq with attention models in Keras. I think there are a few attempts on Github, but I haven't tried them yet. I don't know anybody else that has tried seq2seq w/ attention in Keras yet either.
Additionally, TensorFlow has a seq2seq module, and it does come with an attention mechanism. See https://github.com/google/seq2seq/blob/master/seq2seq/models...
Anyway, I think the best thing for folks like you and me is to just keep using PyTorch for research work, and use TensorFlow for certain deployment scenarios where TensorFlow is superior--like mobile apps and Google Cloud.