|
|
|
|
|
by Thorrez
1528 days ago
|
|
It doesn't directly help UTF8, since all the bytes it uses for encoding non-ASCII have the high bit set. It might directly help with UTF16, I'm not sure. But the general idea of "block only a few specific characters (\0 and /) and allow all the rest" does help with UTF8. If the designers said something like "only ASCII letters and number and dashes and underscores" then that would block UTF8, and we might end up with something like URL hostnames, where you use punycode to encode non-ASCII into ASCII. |
|