| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by wucke13 489 days ago
	Tangentially relevant: Gernots list of benchmarking crimes. https://gernot-heiser.org/benchmarking-crimes.html

1 comments

bjackman 489 days ago

Ha, I thought this would be a really useful resource but I think the people the author is complaining about do much better than most benchmarking I see in the industry

Almost all the benchmarking results I see is just a percentage difference between two algebraic means, no statistical analysis whatsoever.

Very common interaction: QA folks say "your change degraded some of our metrics and improved some others". I know they are full of shit because it's impossible that my change improved any perf metrics. I ask for statistical details, they don't have any, this meeting was a waste of time, it will be next time too.

The fact that I get these reactions suggests that everyone else just lets each other get away with it.

link

porridgeraisin 488 days ago

Yep. The most recent example that's stuck in my head is actually much worse: they didn't even take the mean! One sample!

https://github.com/denoland/pm-benchmark

Check the run bench shell script (there's not much else in the repo anyways)

link

genewitch 488 days ago

Hey, that's perfectly valid for arguing with your friend about which one to deploy on our server, all things equal.

I do this sort of thing to see what tools are faster all the time. ripgrep, ag(silver searcher), grep, MongoDB was one we were arguing about for a while recently.

link

burntsushi 488 days ago

Which one won? :)

(I'm the author of ripgrep.)

link

genewitch 488 days ago

ripgrep, except against the full 160GB dataset, mongoDB was faster on my ryzen.

I have a lot of subtitles. I'm partially hard of hearing and partially i can't stand the way everything is mastered, so i use volume normalization (sometimes called "night mode", vizio calls it this) and subtitles to make up for the fact that the audio tracks in most things is bad.

Well a side effect of subtitles is now i have context for every video that i can search. grep was grep.

you didn't think i'd leave you hanging https://i.imgur.com/Vs5AAT7.png

some other non-statistics from that day: 15GB sorted password list, newline delimited, UTF-8 from spinningrust drive 64 seconds (~234MB/s) to make a copy of the file. ag and rg took 3.2 seconds to search the copy. I'm actually hesitant to state that grep took 52 seconds...

Thanks for replying, thanks for making me remember the great conversations we had around those topics a couple months ago, and thanks for creating ripgrep, it's my go-to for anything non-trivial!

link

burntsushi 488 days ago

Love it! That's awesome. Thank you for replying. :-)

I've occasionally wanted to put the subtitles from all of my Simpson episodes into an easily searchable format. What do you use to extract subtitles?

link