|
|
|
|
|
by icc97
2954 days ago
|
|
> AI doesn't actually work all that well yet. (Neither Siri nor Alexa truly pass a conversational Turing test...) When we pass the Turing test it means we've got actual AI. But I'm not sure there'll ever be a clear line. So Duplex kind of passes it in a very narrow context. Whether or not the person at the end of the line was actually fooled or not is a slightly different question. They could have just been humouring what they figured was a weird automated system. But it's not like Google won't improve exponentially with this. They've now got a basic AI conversation system that they hope people will use and feed it data of actual conversations. So Duplex v2 will have an expanded system where they can handle ten times the number of scenarios and questions. The more I think about it, the more impressive it seems. Most attempts at a Turing test are text only where the subject is supposed to be a 13yo immigrant boy. Here Google's jumping straight to voice conversations. |
|
I can imagine in the future there will be some kind of approximate conversational AI rating analogous to Flesch-Kincaid for text.
But I left a problem off my list, which is that we unconsciously demand AI should be better than average human performance.
If you monitor your conversations with people, you'll find there are regular misses where one person either mishears words, doesn't understand what's said, or misinterprets a subtext.
We cut human conversations a lot of slack. We're used to thinking of humans as independent agents, and there are social conventions about asking for more information and admitting - or sometimes denying - mistakes.
But there's an unconscious expectation that AI should operate at a better-than-human level before it's considered reliable.
We're more likely to think "Stupid machine!" if something isn't understood than we would with a human. So AI will have to cross the Uncanny Turning Valley before we really trust it. And because we're dealing with automated interpretations of human agency, errors will be harder to forgive.
You can already see this with driverless cars, where any accident is considered a failure. Even though statistically an AI may be much safer than the average human, it's not considered good enough unless it can deal with situations that an average human would have no hope of dealing with.