Hacker News new | ask | show | jobs
by hp6 925 days ago
I think the article doesn’t provide enough arguments why this is even an issue.

For example what is the probability of such character being rendered incorectly in some standart tex? lets say a wikipedia article.

Even more so the argumet that people don’t report this because they are "not speakers of English!” is just an assumption. Not to mention that translation applications are more than good enough for such a task.

2 comments

>Even more so the argumet that people don’t report this because they are "not speakers of English!” is just an assumption. Not to mention that translation applications are more than good enough for such a task.

Frankly, people have learned helplessness[0] about these oddities and don't think to report them when they see them, so the inference that something isn't serious just because it's not pointed out is weak.

In the first place, the proportion of software users who raise issues on GitHub/other is small, and when devs are a group of people who communicate in characters that are not used in their daily life, the translation apps they have at hand is not very encouraging.

[0]: https://en.wikipedia.org/wiki/Learned_helplessness

(Disclosure: I'm CJK native)

The issue is that the language is not accurately represented – imagine in English instead of the Latin letter "a" you see the Greek letter "α". It's still legible but it's not unreasonable to ask for an accurate depiction of a language.
The letter “a” has a frequency of ~8%, and it would indead be anoying, but lets say the letter “q” is rendered incorrectly which has a frequency of ~0.1% then thats just some minor issue.
While in a practical sense your argument might hold water (which I doubt, to be honest), this is not just a practical matter. This is also a matter of respect. If you think that mis-representing q is not a problem because it appears so less, then do you also think that disregarding the religious tenets of minorities is fine since they make up such a small part of the population?
I don't know why we shouldn't strive to have computers to reproduce language precisely, since computers are how we communicate most of the time and how the majority of content ends up being preserved. After all, maybe we shouldn't bother with spelling in English either, since everyone can safely approximate the meaning anyway!