| HN Mirror

http://benchmarksgame.alioth.debian.org/play.html#contribute

> If I were to do this in C++ … I would use the C++11 std::unordered_map<> …

The thesis of my comment was that we should be surprised that any language does poorly on this benchmark, particularly ones that have similar kinds of targeting, and that if we cared about this benchmark (and I claim we don't), we should all pitch in, possibly upstream to fix various languages and their standard libraries to nail this benchmark. However, I also believe the rules of this benchmark are awkward and even flawed, and that it isn't clear to me that it is worth anyone's time to do that.

I am not lamenting that someone should spend more time on this: I am lamenting that tons of people seem to care about it at all, it is not a "microbenchmark" (as some are calling it), and I think the main lesson we can learn from it is "there is something subtely wrong, either with the implementation that was contributed for this benchmark, the language's runtime, or it's standard library", as given these rules we really should expect every language to be similarly in performance.

And so, if we all cared about this benchmark, I bet we could figure out what is going on and get every open source language down under 25s. Past that point, I think the rules are such that this isn't even a fun game much less a useful metric of anything worth measuring, and you are probably wasting your time contributing. I guess, to make this subthread go somewhere: why do you disagree?

http://benchmarksgame.alioth.debian.org/

> it is not a "microbenchmark"

Home page -- "Will your toy benchmark program be faster if you write it in a different programming language? It depends how you write it!"

igouy 3393 days ago

Whatever the motivation for your comments, they did remind me that I'd intended to have background information URLs on other pages (not just the home page).

So thanks for that!

nisa 3397 days ago

> it.unimi.dsi.fastutil.longs.Long2IntOpenHashMap?!?

That's just a library that provides data structures without boxing of int,double... in Java. This eliminates a lot of overhead.

Oh, I absolutely understand why it is interesting and fast: what I don't understand is how it satisfies the rules, as it is effectively "some random project with a hashtable". Is this particular one so famous that it should be allowed instead of java.util.HashMap? Can I just publish my C++ hashtable and then rely on it, in which case the rule makes no sense? That's why I tack on "?!".

> instead of java.util.HashMap

Not "instead of" as-well-as.

This entire subthread is clearly a digression ;p, but why do you say that? The rules were about using a built-in or standard collection: this Java implementation does not use HashMap and instead uses a random project with a better data structure for this use case. This is a loophole to bypass the restriction on writing your own hashtable: you just have to publish it first? ;P

http://benchmarksgame.alioth.debian.org/u64q/program.php?tes...

Java #6 => HashMap

Java #3 => HashMap

http://benchmarksgame.alioth.debian.org/u64q/program.php?tes...

Java #5 => HashMap

http://benchmarksgame.alioth.debian.org/u64q/program.php?tes...

Java #4 => HashMap

http://benchmarksgame.alioth.debian.org/u64q/program.php?tes...

Those are different implementations; I know those exist, and which is why I had specifically said there were multiple implementations and pointed at the "faster one", which is obviously the one most people are going to be paying attention to, and which is the one that is also most relevant as we could and probably should expect the implementations for other languages to use some crazy one-off library: it demonstrates and undermines the limitations placed on the various languages seem arbitrary enough that you can't read into these results as being indicative of the languages themselves, but simultaneously that we shouldn't even care as the fast Java entry effectively "cheated".

I am going to take a step back: you seem irked by my comments, due to the knee-jerk link to have me contribute a different implementation for C++ as well as arguing with the short reply what I really maintain is a nearly-offtopic point about the Java library in question here (both of which I am reading as slightly aggressive towards me or my comment).

Do you really like this benchmark? Are you super excited that Rust is slightly faster right now than C/C++/Java? What is going through your head when you read my comments? Do you disagree that we should expect all native-ish languages to do equally-enough well for most people given these constraints? Is it that my comment is coming off to you as "no fun allowed", as I am trying to point out that this is a meaningless competition that mostly teaches us about different things than "who is winning"?

In fact, looking at your recent comment history, I am realizing you are being super aggressive about this with everyone, and are nigh unto spamming the play.html link to everyone who tries to comment about the high-level idea of "what are we doing here". What's up?