Hacker News new | ask | show | jobs
by mcswell 219 days ago
The word 'word' is polysemous, and also vague. Polysemous: it can refer to written words or spoken words (or signed words, in sign languages).

Vague: does it mean all the inflected forms of a word, or just the stem without inflection? Example: is (are?) 'walk', 'walked', 'walks' and 'walking' four words, or one? What about "stand/stood"? (And languages where the bare root or stem can't appear by itself, like verbs in Spanish.) Derived words, like 'push' and 'pushy'.

Do compound nouns count as a word, or do only the parts count? Example: 'doghouse' (or 'dog house'). What about idioms? Example: 'to crane his/her/my/your/our neck(s)'.

What about different pronunciations? Is 'roof' pronounced to rhyme with 'aloof' the same word as 'roof' pronounced with the vowel of 'put'? And different spellings but same pronunciation: 'bear' vs. 'bare'.

What about words with different grammatical categories, like 'push' as a noun ("I gave her a push") or a verb ("I pushed her"). Or the same word with virtually unrelated meanings, "I pushed her on the swing" vs. "I pushed my ideas."