|
|
|
|
|
by 37ef_ced3
1514 days ago
|
|
AVX-512 is an elegant, powerful, flexible set of masked vector instructions that is useful for many purposes. For example, low-cost neural net inference (https://NN-512.com). To suggest that Intel and AMD should instead make "existing REPB prefixes a lot faster" is missing the big picture. The masked compression instructions (one of which is used in Lemire's article) are endlessly useful, not just for stripping spaces out of a string! |
|
NN-512 is cool. I think the Go code is pretty ugly but I like the concept of the compiler a lot.