Hacker News new | ask | show | jobs
by brutos 3280 days ago
AVX are vector instructions. They work on wide register (256bit) where you can stuff 8 32bit floats (or in later versions ints) or 4 (double, or longs not sure if AVX2 or later has support for that), or 16 chars (AVX2 iirc).

APL, as an array language, should, in theory at least, especially profit from automatic vectorization by the compiler. If the auto-vectorization works well, you get get a 4-16x speed-up for free (if you don't have data dependencies).

1 comments

Well that sounds amazing!