|
|
|
|
|
by strictfp
4074 days ago
|
|
Nice article. It inspired me to look around for some more straightforward way of optimizing, and I found the setcc class of instructions: http://www.nynaeve.net/?p=178 I'm thinking that this combined with some CAS (CMPXCHG8B) could acheive the same, right? Something like (pseudo): Comparewith(4) Ifequalstore(54) Ifnotequalstore(2) Return |
|
If you have a lot of data to process, using SSE/AVX is a huge win. Conditional masking and min/max instructions for example.
SIMD is a huge win especially in sorting, you can have 10-40x speed-up by using a bitonic sorting network.