Hacker News new | ask | show | jobs
by matthew-wegner 4144 days ago
I think there is a real danger that people are modifying/learning how to speak to computer voice recognition software. If voice recognition can't quickly become able to parse natural language, it will inevitably have to parse "I'm talking a dumb computer" cadence and inflection instead.

Incremental improvements are very bad in this regard.

These things are hard to reverse, too (people still speak with a very distinct "I'm speaking on a telephone" cadence today).

1 comments

IMHO, people (and hence language) will always adapt in certain ways to get the message across. People already learned how to "google" and expect the same style of search queries to be effective elsewhere. When speaking on the telephone, people tend to slightly change their voice to counteract the channel noise (with acoustic consequences such as increased fundamental frequency ["pitch"], etc.). I would be surprised if a similar adaptation didn't happen for human-computer voice interaction, which would ultimately help making it work well enough to be useful. (Of course, using speech recognition to transcribe human-to-human interaction will still be barely usable..)