Hacker News new | ask | show | jobs
by bane 4546 days ago
I think natural language designers might also look at the letter frequencies and question why 'E' shows up so much. Is the canonical sound it makes just common in English or is there some problem with its "design"? It turns out E is way overloaded in English:

- it's silent in the case of modifying preceding vowels separated by a medial consonant e.g. hat vs. hate, bat vs. bate

- and in older English (or English that wants to feel old) was a superfluous final letter e.g. olde, pubbe

- as a silent letter entirely e.g. eagle

- as itself e.g. egg, education

- as a silent or nearly silent suffix separator for -ed e.g. dropped, judged

- as a non-silent suffix for -ed e.g. educated

- silent as an immediate vowel modifier in vowel digraphs (in some spellings) e.g. archaeology, encyclopaedia, caesar used to be ligatured it was so incidental.

- silent as a modifier on itself e.g. teen, feel

- one of several representation for schwa, ə e.g. taken (takən), enemy (enəmy)

etc.

'e' is a mess. It's mostly silent, either ignored completely or modifying something else (an issue even Benjamin Franklin tried to solve through a proposed spelling reform). It's conflated with schwa (the most common vowel sound in English yet has no singular representation).

A language reformer would probably tackle this letter first and fix a great deal of the spelling problems in English.

1 comments

"Natural language designer" is a contradiction; one of the core defining properties of natural languages (like English) is that they are not designed.

You switched to "reformer" in your closing sentence, perhaps that was what you originally meant, too?

Of course, such a reform is not exactly easy to implement.

I mean natural language as "language for humans to use to communicate with each other" as opposed to programming language as "language for humans to use to communicate with computers". It's the same meaning as is used in NLP.

e.g. https://en.wikipedia.org/wiki/Hangul https://en.wikipedia.org/wiki/Cyrillic and I guess even https://en.wikipedia.org/wiki/Klingon_language

This is different than the meaning of https://en.wikipedia.org/wiki/Natural_language and https://en.wikipedia.org/wiki/Constructed_language

I guess if you want to get pedantic a better term might be "Orthographic design".