| I thought this was an interesting idea. I rated each word in the comment for how much I felt it represented grammar vs semantics (total adding to 1 for each word; ratings in increments of 0.1). The ratings divided into 31.5 words worth of syntax and 37.5 words worth of semantics, adding up to 69 instead of 71 because I combined "a lot" and "as well" into one word each for this purpose. French accounted for 6% of the grammar (reflecting my rating of sure and just as 90% "grammatical" each), and 22% of the semantics. English got 91% of the grammar and 59% of the semantics. The point you might be most likely to disagree with is that I rated many prepositions as 50% semantic. (For example, to in the phrase thirty to forty got that rating, although to in get exposed to and something to read up on were rated 0% semantic.) The second point, cutting in the other direction, is that I rated all pronouns as 0% semantic; realistically they should rate a bit higher. In a better model, I'd probably like to rate them 100% grammatical and also ~30% semantic. (The residual ~3% of grammar is the passive marker get, from Norse.) If this is the kind of thing you enjoy, I'd be interested in your evaluation. |
At a first pass I'd just say that adjectives, nouns, and adverbs are "vocabulary", and everything else is grammar.