Hacker News new | ask | show | jobs
by _obviously 825 days ago
Unicode is Turing complete which makes it an attack vector.
1 comments

It is a set of glyphs and their encodings. How is that 'Turing complete'?
Unicode involves more than the set of glyphs and their encodings; it also involves properties, etc. However, it can be an attack vector even ignoring that stuff; it does not have to be Turing-complete to be an attack vector. But, the specific kind of attacks depends on the application.

Different kind of character sets and character encodings will be good for different purposes. Unicode is "equally bad" for many uses.

Yes turning complete wasn't the right term. , it would be better to say that it's surprisingly complex to parse
Yes, Unicode is too complicated and too messy, whether or not it is Turing-complete (it is complicated enough that maybe it is Turing-complete; I don't know).