Hacker News new | ask | show | jobs
by sbov 5321 days ago
Looks kinda like a localization failure. If you have e.g. Chinese in UTF-8 but mess up the output (or input) somewhere it tends to look something like that.
1 comments

It looks like it's GB18030, HZ or GBK. At least those are the ones that render with no unknown characters. I don't read/speak any Chinese dialects so I've got no idea which two encodings are likely gibberish.
GB18030 is a superset of GBK. HZ is generally used on 7-bit only mediums like Usenet.