| > Not much of an issue unless you actually need the performace ofcourse. Ime, simd intrinsics is everywhere in code optimized to run as quickly as possible on x86. That about half of The Benchmark Game's benchmarks uses sse proves that point. My point is that the Benchmark Game is not representative of real world code. The website says as much. Because the benchmarks use sse everywhere does not mean that most code, even perf-sensitive code will use simd everywhere. Again, if you need simd, use a nightly. There's little to no drawback there. I fixed it up to run on modern rust (https://gist.github.com/Manishearth/5fc73c405641162f0712951c..., compile with cargo build --release), and the numbers I get are: (Ranges are just what I got from 5 runs, nothing scientific) Rust: 610-630 c: 706-716 c_fast: 919? cpp_clang: 669-694 cpp_plain: 717-728 I'm on a new (i7, 16gb) Mac so I don't yet have g++ around (nor do I know how to obtain it without messing things up; I'm used to linux), everything here done with clang. Of course, this isn't an indication that Rust is faster than C. But it is an indication that it can be just as fast, and a reinforcement of my point about microbenchmarks having large error bars. Edit: On my older x86 linux laptop (with gcc): Rust: 844-987 c_fast: 808-860 (perhaps clang somehow made c_fast slower than c on the mac? shrug) c: 982-1025 cpp_plain: 977-1019 cpp_gcc: 925-947 I think I've proven my point. |
Your point is incorrect. simd is everywhere in performance sensitive code, like in memcpy, memset, strlen, strcmp, image&video decoding...
> I fixed it up to run on modern rust (https://gist.github.com/Manishearth/5fc73c405641162f0712951c..., compile with cargo build --release), and the numbers I get are:
Note that the C benchmarks are all compiled with `-g -O2`. I'm not the author of that benchmark suite and it appears whoever is has abandoned the project.
If I fix the compiler switches (-O3 obviously) and recompile, the numbers I get are:
I'm using Rust Nightly because I can't be bothered to install more than one Rust compiler.That the numbers you are getting aren't stable suggests that you are using shoddy benchmarking techniques. Try and run them with as few applications open as possible.
Here are my updates to the c_fast benchmark:
https://gist.github.com/bjourne/4599a387d24c80906475b26b8ac9...
With this c_fast's number is 532. That is a fair bit faster than Rust and I'm sure someone who has more time than me and is more skilled at optimizing C code can improve it further.
I'm compiling with: `clang -O3 -march=native -mtune=native -fomit-frame-pointer c_fast.c -o c_fast` and my cpu is an "AMD Phenom(tm) II X6 1090T Processor"