Hacker News new | ask | show | jobs
by mkup 2330 days ago
But how would Haskell version of wc compare with C version of wc running with LC_ALL=C environment variable? UTF-8 locale is much slower than C locale in coreutils, it's a well-known fact, and their Haskell version of wc is already using fixed 8-bit characters.
1 comments

wc was actually slower with LC_ALL=C as opposed to ru_RU.UTF-8 that my system normally runs with (about 10 s against 7.2 s).

Which actually raises a good question of whether I should have been comparing with that one — but that'd probably raise more questions and lead to more people accusing me of cheating in favour of Haskell.

The only way I can think of this could be true is if you made a mistake in setting an env var.
Well, counting characters is obviously different from counting bytes when the characters are UTF-8 encoded, or UTF-32.