For people interested in a cleaned-up, commented and de-obfuscated word2vec, I recently ported the original C code to Python [1].
My HN submission of this endeavour received no love, but I think it's worthwhile nevertheless as the Python code is not only more concise, readable and extendable, but the training's actually faster too [2].
Mikolov said that he hoped word2vec would "significantly advance the state of the art" of NLP, but really the state of the art can only advance when people can understand and manipulate the code. You're making that possible. Thank you.
That is some amazing work, thanks!