That’s what I was referring to in the first sentence: you can see the raw text from the speech system change afterwards. Normally that’s things like punctuation and ambiguous words like their/they’re. That secondary process felt like a system which operates on test tokens because “racist” and “Mamaroneck” don’t sound similar at all.