|
|
|
|
|
by dpandya
3153 days ago
|
|
It seems that the primary contribution of this technique is that it uses specific assumptions supported by neuroscience research in order to allow for composability of learning and better generalization. By introducing these specific assumptions (e.g. contours define objects), they are able to reduce the complexity that the model has to learn and thereby reduce the amount of data that it needs. Obviously, the question then becomes: what happens when you have visual situations that violate or come close to violating the assumptions made? I'm not familiar enough with the specifics of RCNs to be able to answer this; maybe someone else can. Very interesting paper and approach regardless. |
|
I haven't read it but skimming, I could see that there definitely were no formulas in it at all . Which sort of says, at best what it tells you is "we did this thing, which is kind of like X and kind of like Y with Z changes". Essentially, no way to reproduce or understand by itself. The first reference then had a link behind a paywall...
So despite lots of apparent explanation, it seems like what they're actually doing is essentially unspecified (at least to the interested layman). It seems like at best an expert in the field of "compositional models" could say what is happening.
Also, the paper is published under the heading of an AI firm Fremont, ca rather than folks in a university, with the many authors listed by initial and last name...
PDF for the curious:
http://science.sciencemag.org/content/sci/early/2017/10/26/s...
Edit: tracked down that apparently has some "real" math. Whether is even what the OP is doing remains to be seen.
https://staff.fnwi.uva.nl/t.e.j.mensink/zsl2016/zslpubs/lake...