|
|
|
|
|
by flor1s
3155 days ago
|
|
I only skimmed over the article, but I think the title on HN does not reflect the claims the authors are making. The title of the paper is: A generative vision model that trains with high data efficiency and breaks text-based CAPTCHAs The title of the article is: Common Sense, Cortex, and CAPTCHA That's nowhere near the sensationalist title at HN: RCN is much more data efficient than traditional Deep Neural Networks |
|
Learning from few examples and generalizing to dramatically different situations are capabilities of human visual intelligence that are yet to be matched by leading machine learning models. By drawing inspiration from systems neuroscience, we introduce a probabilistic generative model for vision in which message-passing based inference handles recognition, segmentation and reasoning in a unified way. The model demonstrates excellent generalization and occlusion-reasoning capabilities, and outperforms deep neural networks on a challenging scene text recognition benchmark while being 300-fold more data efficient. In addition, the model fundamentally breaks the defense of modern text-based CAPTCHAs by generatively segmenting characters without CAPTCHA-specific heuristics. Our model emphasizes aspects like data efficiency and compositionality that may be important in the path toward general artificial intelligence.