|
|
|
|
|
by account42
1232 days ago
|
|
AFAIK, overlong encodings are not valid UTF-8 [0] which means the canonical encoding of each unicode code point is clearly defined as there is only one valid encoding. That still leaves higher level Unicode shenanigans, but every Unicode encoding is going to have those. And of course, what applications accept in practice is an entirely different issue. [0] https://en.wikipedia.org/wiki/UTF-8#Overlong_encodings |
|
Part of why it's a good idea to normalize input before password hashing, as an example... It will likely become more common over time to use emoji as passphrase input.
1. https://en.wikipedia.org/wiki/Unicode_equivalence