|
|
|
|
|
by lachlan_gray
1152 days ago
|
|
Since when we talk, our tongues tap patterns on the roof of the mouth and the back of the teeth, I wonder if AI processing could infer what words you are shaping from these sensors. Maybe it’s possible to input text by mouthing words silently, but without opening your mouth. Kind of like how it’s possible to eavesdrop from just the sound of keyboard clicks: https://github.com/ggerganov/kbd-audio |
|
There are a number of other dimensions however that are equally important in the creation of word-sounds (e.g., whether the lips are pursed, whether the vocal folds are vibrating, whether the teeth make contact with the lips, where the tongue is located in the space of the mouth [for vowels], etc) and would make determination just from the dental/palatal axes pretty difficult I think. But maybe with enough context, you could get something predictive that is more than good enough, even if it's not into deterministic territory