Hacker News new | ask | show | jobs
by 3pm 759 days ago
Good paper on UTF-8 validation performance: https://arxiv.org/pdf/2010.03090

    The relatively simple algorithm (lookup) can be several times faster than conventional algorithms at a common task using nothing more than the instructions available on commodity processors. It requires fewer than an instruction per input byte in the worst case.