|
|
|
|
|
by slack3r
2670 days ago
|
|
Lol, my comment was restricted to SIMD, which definitely does not take 10 minutes. Secondly, no mainstream compiler actually compiles code to the PCMPISTRI instruction. Presumably it was meant to be used directly as assembly. I'm not sure why you are bringing in this obscure instruction into the discussion of superoptimizers. I personally have a paranoid fantasy where NSA/GCHQ introduced this instruction to speed up password cracking. :D |
|
In any practical vectorized tight inner-loop, the block you're trying to optimize is inherently going to be large. Superoptimization is exponential in the size of the block being optimized, which limits its utility. That was my entire point: it becomes unacceptably expensive way too quickly to get used in compilers. (Some of the code I'm looking at right now has 100s of instructions in a single basic block, definitely not atypical for a compiler).