| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by DigitalTurk 4806 days ago

Very nice work!

It seems like this is only fast on large files, though, because the text needs to be copied from the main RAM memory to the GPU, which introduces latency. I wonder what latency would be like if this algorithm was instead run on the kind of unified memory architecture that you see in e.g. the PS2 and XBox One.

Also, I don't quite follow why they're compiling the finite automata on the GPU. To me their explanation that they didn't want to copy the automaton node per node sort of sounds like there's a lot of room for optimization here. E.g. maybe the regular expressions could be compiled to OpenCL code.

Then again, they did also find that pattern matching is a memory bound problem so maybe emitting native code is pointless. Anyone know if there are regular expression engines that compile emit native x86 code?

1 comments

dbaupp 4806 days ago

Newer versions of PCRE have a JIT: http://en.wikipedia.org/wiki/Perl_Compatible_Regular_Express...

link

simonster 4806 days ago

V8, SpiderMonkey, and jsc also all have regular expression JITs.

link

bdash 4806 days ago

SpiderMonkey uses JavaScriptCore's regular expression JIT, YARR (Yet Another RegExp Runtime).

link

cglace 4806 days ago

I believe pypy also has a regex jit.

link

kingkilr 4806 days ago

Yup we do.

link