|
|
|
|
|
by greato
3255 days ago
|
|
Using tf.scan is a bad idea. scan implements strict semantics so it will always execute the same number of timesteps no matter what the accumulator is (nan). while_loop implements dynamic execution (quit once cond is not met) and at the same time allows parallel execution when some ops are not dependent on accumulator. If you read the code for `dynamic_rnn` and contrib.legacy Seq2seq model you'll find while_loop. I have yet to see tensorflow library code using tf.scan anywhere! Also, internally, scan is defined using while_loop. In my code, I find scan lacking in RNN and always have to fall back to while_loop. Here is video of a talk by the RNN/Seq2Seq author himself: https://youtu.be/RIR_-Xlbp7s?t=16m3s |
|
The difference from using tf.while_loop directly is that tf.scan handles the logistics of an accumulator to keep track of hidden states, so you don't have to implement that piece yourself.
As you say, tf.scan uses tf.while_loop internally; it's not particularly different from something you might build using tf.while_loop yourself.