Hacker News new | ask | show | jobs
by lifthrasiir 1385 days ago
Just in case, here "Unicode routines" refer to Unicode encoding routines, not other more complex things like normalization. They are pretty regular but their automata would be relatively simple. (By the way, SIMD-accelerated regular expression parsing is indeed a thing [1], if anyone wondered.)

[1] https://www.microsoft.com/en-us/research/wp-content/uploads/...

2 comments

Intel has an implementation of this technique here as well:

https://github.com/intel/hyperscan