|
> I’d like to give you a _tour_ of my _doubts_ _about_ this, but the _courier_ has just arrived with _four_ _doughnuts_ (and their _colour_ is popular in Britain) Right, those are the exceptions I mentioned. There's more, but even in that list, those are not singular exceptions, they're different patterns. They only stop being patterns when, as you did, match the shortest subsequence and not the longest subsequence. Even in the exception list, you have patterns: about and doubt rhyme. If you're Canadian, they also rhyme with dough. four and the 'cour' in courier rhyme. colour and honour rhyme. If we're using regexes, for example, we match the longest subsequence, not the shortest, so "ough" is the pattern in "dough", not "ou". Then it rhymes with though, furlough. The examples I gave, like `tion` as a suffix, should have been clear that I meant the matching the longest pattern (otherwise it would be matched as 'ti' and 'on'). > We get used to these things at an early age, but compared to many other languages, English is highly irregular. Sure, but I didn't dispute that, I contended that 90%+ of common usage is pattern recognition, like doubling of consonants, words ending in `e`, or starting with `in`, etc. An english reader encountering `shibboleth` for the first time will pronounce it correctly, and I claim that that is true for 90% of words in common english usage, because even the simplest words have differing patterns and so readers are forced to learn pattern recognition as a very basic and foundational part of english. It is not as dire as phrase "English is highly irregular" would suggest. To my mind, a highly irregular language would have at least half the words following no pattern, for example rhyming "moot" with "dad". Examples of non-patterns like that are, to my knowledge, not in english. I mean, you could claim that "caught" and "court" are pronounced exactly the same, and I'd point out that both are parts of larger patterns - 'taught', 'aught' and 'caught' are a pattern, while 'court', 'pour', 'rigour' are a different pattern, hence they are both examples of patterns, they just not in the same group of patterns. Look at your final example - lower and tower: lower, grower, mower are all part of one pattern. tower, bower and shower are all also part of a pattern, but it's a different pattern to the previous pattern. You will not, in english, easily spell a word that is not part of some pattern[1]. [1] Although, if you're up to the challenge, I welcome examples of spelling that is not part of any pattern ... :-) |
In many languages though, the irregulars are at single digit percentage - sometimes even zero.
And there are easily some that are not part of a pattern: “colonel” (pronounced “kernel”). American “herbal” (pronounced “erbal”), autophagy (with the emphasis on “to”, unlike any other word that starts with “auto”).
And there are ambiguous ones which in fact fit multiple patterns - e.g. “route”, british more like “flute”, American like “house”. Not to mention to-mate-o to-ma-to and either. And injured vs insured.
I don’t think anyone whose first language is regular (like German, or Japanese) would agree with your claim that English is not highly irregular.
If you need an order of magnitude more patterns to properly pronounce words (and you do) it’s a difference in quality, not just quantity.