Hacker News new | ask | show | jobs
by visarga 3559 days ago
A clip on camera you can take with you while jogging or in a museum, and it talks to you? It would be pretty embarrassing to be seen with it. This would work out better in professional settings, such as in a hospital, providing help to doctors.
3 comments

I think it depends on the interaction. If it behaves more like a toy than a tool, I could see it being kind of embarrassing. Then again, imagine just how annoying it would be to have people walking around in a museum saying "Asteria, tell me about this painting." and then the device blurts out some description from Google without using an inside voice.
> and then the device blurts out some description from Google without using an inside voice.

It would sound like an American tourist by doing that.

(There’s lots of articles from US expats in europe, or european expats in the EU, showing how US-Americans tend to speak a lot louder than Europeans in quiet settings, from museums to restaurants)

This leads to an interesting question: Which culture should a voice assistant follow? Should there be multiple variants of each assistant?

Detecting and adapting to volume isn't that big of a challenge in comparison to natural language processing. But if you mean something more subtle, like discretion or taboo...that's probably much harder than NLP.
Yes, and also adapting to mental concepts of different things.

That starts with phrases, but also applies to other concepts – different cultures have different orientation systems even (some use cardinal directions (north, east, south, west), some use relative directions (front, right, back, left), etc)

I've only heard of one example of the cardinal directions being used as a main orientation reference in day-to-day language: http://www.nytimes.com/2010/08/29/magazine/29language-t.html...

I never read the research about this but examples are http://anthroweb.ucsd.edu/~jhaviland/Publications/ETHOSw.Dia... and http://pubman.mpdl.mpg.de/pubman/item/escidoc:66622:3/compon... (very interesting stuff!).

Edit: I would highly recommend reading the 2nd paper (which includes some practical experiments testing how Guugu Yithimirr speakers thought about and remembered spatial positions and orientations). It's astonishing.

The thing is, it doesn’t stop there.

Even colors depend on cultures heavily. The ancient greek are believed not to have seperated between yellow and green, other cultures similar.

Internationalization is a lot harder than it seems to most people. And then there’s also accessibility.

Even with traditional UIs where everything is hand made it’s already an extremely huge task, but a conversational UI is far more personal.

It has to deal with things like how much privacy or directness is expected in cultures, with taboos, it has to have a full perceptional model of the person who will hear it to be able to properly handle all this.

The idea would be that the Asteria device learns to interact with you in social norms, like how you would act. And with that, some locations are better than others, but it really evolves to converse with you in your daily life, to help fill your needs.
This is your startup, I'm not sure why you're speaking in the hypothetical "would be." You either know what your product does, or it doesn't.
Vision ≠ product. Everything I'm seeing here sounds like a vision.
Perhaps the conspicuousness of wearing and being talked to by such a device is a positive thing. Shocked observers will make negative or bewildered remarks about the proposition, but some who hear about it will be receptive, all without the company taking out a single advertisement.

(Maybe.)

> while jogging or in a museum, and it talks to you

That's what museum tour guides, human or automated, do today.