|
|
|
|
|
by Dylan16807
2799 days ago
|
|
> My understanding was that utf8 was crafted specifically so that the only null byte in it was literally NUL. Correct. > That all normal human language described by a utf8 string will never contain a NUL. Correct. > If you have embedded NULs, it's not really utf8, is it? Incorrect. NUL is a valid character. If you accept arbitrary utf-8, or arbitrary ascii, or arbitrary 8859-1, then there might be embedded NUL. You can filter them out if you want, but they're not invalid. |
|