|
|
|
|
|
by MattPalmer1086
761 days ago
|
|
Interesting, thanks for the detailed response. I'll have a look at the benchmark; I'm doing some work on algorithm benchmarking right now by coincidence. I'd say a long pattern is more like 64 bytes (the benchmarking suite I use defines short patterns as 32 or under). Edit: will also check out the frequency approach used in ripgrep, sounds fascinating. |
|
https://blog.burntsushi.net/ripgrep/
https://blog.burntsushi.net/regex-internals/
64 bytes is decently long, but I'd still put my money on SIMD for "common" cases.