| That won't work as a first pass. That gets you results like "there's also a lot of influence from French" being 2/3 semantics and 1/3 grammar†, with there holding just as much semantic content as influence does. It also disqualifies pronouns from counting as grammar at all, which is much more defensible than disqualifying semantically empty words, but not a common perspective. I tend to take the perspective that if a foreign speaker is unlikely to have any trouble learning how to use a word correctly, that word is semantic, and otherwise, the word is grammatical. †Assuming that the omission of verbs from your list of semantic words was a mistake. Otherwise you're up to 44% grammar. I did count "is" as being grammar, but I would certainly not extend that judgment to all verbs. --- results --- By your standard, English is 61% of the semantics and 91% of the grammar (if verbs have no semantics), or 62% of the semantics and 96% of the grammar (if verbs do have semantics). French is 21% of the semantics and 6% of the grammar (if verbs have no semantics), or 20% of the semantics and 4% of the grammar (if verbs do have semantics). I don't think much of your methodology, but it's worth noting that your overall numbers are almost identical to mine. (When verbs are meaningless; still very close but distinguishable otherwise.) In reality, of course, many verbs such as sharing are rich in semantics, and many others such as do are more or less empty. |
But I think of pronouns as grammatical, as well as the auxiliary particles in verb forms like "there is", "to go to", etc. So "have" and "is" can function grammatically when they're part of the verb form of another root verb, like "have been seen" and so on.
"Do" is obviously semantic when it's the main verb, e.g. "I'm doing my job" versus "I'm leaving my job". In the selection you quoted it's also playing a grammatical role which is just to point to the main verb form of the sentence, i.e. it could be replaced by repeating "get exposed to (titbits from)" without changing the meaning of the sentence.
So in "there is also a lot of influence from French", I would put "there _ also a _ of _ from _" as grammatical.
I'm sure my way is naive, but it's based I think on well-established categories. I'm not sure how linguists would distinguish grammatical words or even if they categorize based on words at all. e.g. "a lot of" as a quantifier might be completely grammatical, same as "more", "less", "thirty", etc.