Hacker News new | ask | show | jobs
by agnokapathetic 4736 days ago
Google Translate uses statistical machine translation [1] seeded from a gigantic automatically curated parallel corpus of similar documents.

As"lorem ipsum" is a typographic placeholder, the filled in version appears appears to have the same document structure (HTML) and would therefore be statistically likely candidates as translatable pairs.

[1] http://www.youtube.com/watch?v=y_PzPDRPwlA