|
I'm curious to know what you mean by "labelled information". I'm guessing that what you are calling "labelled information" is various forms of encouragement or discouragement that could be considered positively and negatively "labelled" examples. If that is the case, linguistics research back in the '70s found that infants get almost no negative examples of, in particular, language. For example, a parent will not correct a child by saying, "no you can't say 'eated' because then you could also say 'sitted'". Instead they will correct by saying "no, you should say 'ate'" etc. That is important because there was a famous proof in inductive inference (the precursor to computational learning theory) that languages higher in the Chomsky hierarchy than regular languages cannot be learned from positive examples alone. And yet, babies eventually learn to speak human languages, which are assumed to be at least context-free. Chomsky used these findings to support his claim of a "universal grammar" or innate language endowment [1]. If you are talking about multi-class labelling, that's even harder to imagine. In machine learning, a multi-class classifier will map inputs to some set of categorical labels (i.e. a set of integers) but the mapping from those labels to concepts that a human would recognise, such as 1:cat, 2:bat, 3:hat, etc, must be perormed manually, because the classifier and humans do not have a shared understanding of what e.g. "cat", "bat" and "hat" mean. The classifier only knows 1,2,3... etc, the human knows that "1 means cat". How would this lack of shared context be resolved between an adult and a baby, so that the adult could provide "multi-class labels"? ___________ [1] Sorry that I don't have any references for all this handy- I can try to dig some up if you're interested, but you could start by reading the wikipedia page on Language Identification in the Limit, which is about the famous result from inductive inference I mention (also known as Gold's result from the man who derived it): https://en.wikipedia.org/wiki/Language_identification_in_the... |
IMHO, modern cognitive science based approaches (such as by Tomasello and others) have a better chance of explaining how language is acquired than the hypothesising of the 70s.
I don't have time now to go into more references, but the question is far from settled.
[1] https://scottthornbury.wordpress.com/2015/06/07/p-is-for-pov...