Hacker News new | ask | show | jobs
by unityByFreedom 3246 days ago
> An international A.I. mission focused on teaching machines to read could genuinely change the world for the better — the more so if it made A.I. a public good, rather than the property of a privileged few.

> author: Gary Marcus is a professor of psychology and neural science at New York University.

Not sure what he has in mind. There are already a lot of smart people building Q&A systems. We need tests to establish if a system can read. Once you have those then you can throw a competition up on Kaggle with a big purse.

2 comments

No systems can really understand what they read or translate yet. They are basically sophisticated pattern matching systems.

Check out Winograd Schema: https://en.wikipedia.org/wiki/Winograd_Schema_Challenge

Overview by an expert: http://www.cs.nyu.edu/faculty/davise/papers/WinogradSchemas/...

An example: The city councilmen refused the demonstrators a permit because they [feared/advocated] violence.

When you switch between "fear" and "violence", the meaning of 'they' change. There are many more examples like this.

The best performance in the first round of the 2016 challenge was 58% by a neural network based system. Random guessing would yield 44% (some questions had more than 2 choices). Human performance was 90.89% with a standard deviation of 7.6%.

Here are the challenge problems used in the first round: http://www.cs.nyu.edu/faculty/davise/papers/WinogradSchemas/...

Human Subject Test Performance: http://www.cs.nyu.edu/faculty/davise/papers/WinogradSchemas/...

It's important to note that Winograd Schemas don't really test if the system understands those sentences, they essentially test the system has appropriate "common sense" knowledge/experience about how our world and society works, i.e., it tests whether the system understands whatever other data sources are usable to find out about this topic.

To give the proper answer in the example you use, a human (or a system) needs to know how such permits are issued and what are the common reasons for refusing such permits. As such, a sufficiently sophisticated pattern matching system is perfectly sufficient to answer such questions - there's a simple pattern difference that fearing violence causes you to refuse permits but advocating violence causes you to get refused. It's worth thinking about where do humans learn this? For the Winograd schemas like putting a trophy in suitcase, it's the basic childhood experience of putting stuff in boxes that we all share, but a machine won't (unless it's raised as a child-robot). For schemas like this one, it's understanding how our society works learned by participating in our society for years, which we all share, but a machine won't (unless we allow machines to participate in our society). I.e. it's not so much a measure of intelligence as a measure of shared background experiences. A human from a hunter-gatherer tribe wouldn't be able to answer the councilman-permit schema, but that doesn't mean he/she isn't intelligent.

The difficulty there is caused mainly by the need to have domain-specific knowledge in a wide range of domains - we will perceive systems as "dumb" unless they share the same background knowledge that most humans have gained by being part of our society and basic schooling, and since the machines won't do that (yet), we're looking for "unnatural" ways of getting common sense knowledge without the direct experimentation and participation that we do.

Winograd Schema and bAbI seem like good intermediate goals?
Winograd Schema yes, bAbi no. bAbi is a nice dataset for understanding how different kinds of algorithm perform, but it is trivially solvable.