|
|
|
|
|
by chubot
1111 days ago
|
|
Mainly because Windows adopted UCS-2 and the hacky extension UTF-16 around when the superior and ASCII-compatible UTF-8 was invented And Java and JavaScript followed Windows, and Python is constrained by it The surrogate pairs of UTF-16 even infected JSON and thus implementations in all languages, but funny enough encoded JSON is specified to be UTF-8, which is better but a bit confusing Newer, sane languages like Go and Rust are more Unix-like and use UTF-8 natively It’s basically a Windows vs Unix problem |
|
UTF-8 was publicly presented at USENIX at the beginning of '93. Not sure when Unicode incorporated it.
It is unlikely that Windows would have been changed at the last minute to use it, especially as the variable encoding of UTF-8 was significantly more complicated than the fixed size UCS-2.