|
|
|
|
|
by Kenji
3498 days ago
|
|
Unicode URLs are the devil. Too many indistinguishable characters. URLs should stay full ASCII imho. And I say that as someone whose language requires non-ASCII symbols. Or, in Bruce Schneier's words: "Unicode is just too complex to ever be secure." |
|
You really need to support this 'sub café {} café()' => Undefined subroutine café in your friendly and social programming language, otherwise you will be accused of discrimination. Of course the two é are not normalized.
Which unicode-friendly language does really check for mixed script confusables? Java only is my guess. Even perl6 falls into this trap.
http://unicode.org/reports/tr39/#Mixed_Script_Confusables