From what I can gather the supervised approach allows you to only learn the representation for style rather than which digit it is. The only reason to use one over the other is to demonstrate that it works, I guess?
Though, in the finest academic tradition, once you try to actually run it, you'll find that it silently depends on a separate library written by the author, which you'll have to find yourself.
I though I have removed all the dependency on 'tensorcv'. It turns out I forgot the dataflow part. Now it should be run without 'tensorcv'. Thanks for pointing out.
And if it's just being used for the MNIST dataset, is there a particular reason for using it in one or the other fashion?