|
|
|
|
|
by rck
4823 days ago
|
|
It looks like the research paper this is based on is here: http://schererstefan.net/assets/files/scherer_etal_FG2013.pd... I've only skimmed it, but the vision work looks sound, and it looks like it makes pretty essential use of the depth map from the Kinect. But I don't think there's any speech recognition going on, so that part is just acting (from both the humans and the virtual platform). I'll bet an untrained user could get the system to break pretty quickly... Neat proof of concept, though. |
|
Reminds me of a certain someone asking another certain someone why he flipped a tortoise in the desert...