Hacker News new | ask | show | jobs
by fps_doug 1488 days ago
That happens if you have no language hints, or the wrong one, e.g. posting in Simplified Chinese on a Taiwanese website. If this was written in something like MSWord by CCP officials, it should have the proper language hint, so render properly on any OS newer than XP.
1 comments

I'm not sure what you mean by language hints.

Setting aside all other assumptions you make about the soundness of their setup overall, consistency of their input methods, newness of their inventory, etc., do you actually believe they would have the fonts with traditional glyphs in them installed and used at all? What for? Remember, this applies to the system as a whole. A character would be shown as simplified by the system even during input.

Again, I tried it and I got different results in different software (even on a Mac), with Pages in particular showing only simplified characters and straight layout (in contrast to Quick Look, which is what the tweeter must have used). Do you seriously think CCP officials have a fleet of Macs to check document appearance in case they are leaked and/or scan documents for "enemy" Unicode points? If not, how they would even know what code points are there, if all they ever see is simplified?

One needs to look at vocabulary, word choices and such. That is something that could actually point to fakery. Nothing like that was claimed yet, of course.

> I'm not sure what you mean by language hints.

Because of the han unification, you can tell the font renderer which language context you're in and want things to be rendered. MSWord shows you the language in the status bar at the bottom, which is not only used for spell checking. In html, you can add the lang attribute to a tag to tell the browser what language the contained glyphs belong to.

> do you actually believe they would have the fonts with traditional glyphs in them installed and used at all? What for?

Because ever since Vista, these come pre-installed regardless of your locale.

> Do you seriously think CCP officials have a fleet of Macs to check document appearance in case they are leaked and/or scan documents for "enemy" Unicode points?

I don't believe anything in particular, just adding technical context. The documents could also have been leaked through Taiwan or Hong Kong and then mangled there resulting in this.

> In html, you can add the lang attribute to a tag to tell the browser what language the contained glyphs belong to.

Well, you can visit probably any official Chinese government department website right now and see traditional Japanese characters instead of their simplified equivalents, if your machine happens to be configured that way. (Or at least the first one I stumbled across was like that, I pasted link somewhere in another comment. And I most certainly have Chinese fonts installed; in fact I see only simplified characters when I open the tweeted document in Pages.)

So they clearly do not make that effort even with documents actually crafted with foreign readers in mind. Presumably things can't be expected to be better if we are discussing secret documents intended for internal CCP consumption.

Right, and a premise of the tweet cited here is that different character sets mean you should just freely assume it's fabricated by Taiwan. It doesn't make that argument (at least not anywhere in the cited thread), it just presents an examination of characters with that as an underlying assumption.