|
|
|
|
|
by ubernostrum
3200 days ago
|
|
that explanation of UTF-8 is crap. UTF-8 is beautiful quite apart from its utility, but you'd hardly know it from the article My goal was not to judge UTF-8 aesthetically, but to explain how it works and point out that it's a variable-width encoding which emphasizes its compatibility with ASCII for strings containing only code points <= U+007F. Unicode Consortium et. al. are absurdly arrogant. I would agree that Unicode as it exists today involves some historical and historic bad decisions. But again, staying off value judgments with respect to Unicode itself since the point of the article was to explain how Python now handles it internally. |
|
Apologies for being cranky. You did a great job explaining how Python now handles Unicode!
To me it was strange reading about UTF-32 first and then getting to UTF-8 from that context. It seemed to obscure the coolth and beauty of the format.
Overall a great article, sorry again for being so negative.