|
|
|
|
|
by ciprian_craciun
1232 days ago
|
|
UTF-8 is both "binary" and "text" at the same time. :) It's binary because the UTF-8 standard states how each Unicode code-point (i.e. character) is to be translated into a series of bytes. But, because each (correct) UTF-8 byte sequence can be translated back into a Unicode code-point sequence, you can see it as text also. :) (BTW, from my knowledge, UTF-8 doesn't specify a canonical encoding of Unicode text, thus for cryptographic purposes, especially signatures, perhaps one should treat it with care.) |
|
[0] https://en.wikipedia.org/wiki/UTF-8#Overlong_encodings