Hacker News new | ask | show | jobs
by amluto 2858 days ago
> a lot of UTF-8 codecs in the real world actually no longer contain the code for handling the longer sequences.

Not exactly. A conforming decoder MUST reject them.

MySQL’s problem is that, by default, it can’t even handle all valid code points.

1 comments

They reject them by not having a code path that successfully decodes them.