DocumentCode
1503947
Title
Spatio–Temporal Multimodal Developmental Learning
Author
Zhang, Yilu ; Weng, Juyang
Author_Institution
Gen. Motors Global R&D, Warren, MI, USA
Volume
2
Issue
3
fYear
2010
Firstpage
149
Lastpage
166
Abstract
It is elusive how the skull-enclosed brain enables spatio-temporal multimodal developmental learning. By multimodal, we mean that the system has at least two sensory modalities, e.g., visual and auditory in our experiments. By spatio-temporal, we mean that the behavior from the system depends not only on the spatial pattern in the current sensory inputs, but also those of the recent past. Traditional machine learning requires humans to train every module using hand-transcribed data, using handcrafted symbols among modules, and hand-link modules internally. Such a system is limited by a static set of symbols and static module performance. A key characteristic of developmental learning is that the “brain” is “skull-closed” after birth - not directly manipulatable by the system designer - so that the system can continue to learn incrementally without the need for reprogramming. In this paper, we propose an architecture for multimodal developmental learning - parallel modality pathways all situate between a sensory end and the motor end. Motor signals are not only used as output behaviors, but also as part of input to all the related pathways. For example, the proposed developmental learning does not use silence as cut points for speech processing or motion static points as key frames for visual processing.
Keywords
biomechanics; brain; hearing; learning (artificial intelligence); medical computing; neurophysiology; spatiotemporal phenomena; visual perception; auditory modality; cut points; machine learning; motion static points; parallel modality pathways; sensory inputs; sensory modalities; silence; skull-enclosed brain; spatial pattern; spatio-temporal multimodal developmental learning; speech processing; visual modality; visual processing; Contracts; Humans; Indium tin oxide; Joining processes; Machine learning; Permission; Spatiotemporal phenomena; Speech processing; Speech recognition; Uncertainty; Developmental architecture; multimodal development; speech recognition; visual recognition;
fLanguage
English
Journal_Title
Autonomous Mental Development, IEEE Transactions on
Publisher
ieee
ISSN
1943-0604
Type
jour
DOI
10.1109/TAMD.2010.2051437
Filename
5473115
Link To Document