Title :
Spatio–Temporal Multimodal Developmental Learning
Author :
Zhang, Yilu ; Weng, Juyang
Author_Institution :
Gen. Motors Global R&D, Warren, MI, USA
Abstract :
It is elusive how the skull-enclosed brain enables spatio-temporal multimodal developmental learning. By multimodal, we mean that the system has at least two sensory modalities, e.g., visual and auditory in our experiments. By spatio-temporal, we mean that the behavior from the system depends not only on the spatial pattern in the current sensory inputs, but also those of the recent past. Traditional machine learning requires humans to train every module using hand-transcribed data, using handcrafted symbols among modules, and hand-link modules internally. Such a system is limited by a static set of symbols and static module performance. A key characteristic of developmental learning is that the “brain” is “skull-closed” after birth - not directly manipulatable by the system designer - so that the system can continue to learn incrementally without the need for reprogramming. In this paper, we propose an architecture for multimodal developmental learning - parallel modality pathways all situate between a sensory end and the motor end. Motor signals are not only used as output behaviors, but also as part of input to all the related pathways. For example, the proposed developmental learning does not use silence as cut points for speech processing or motion static points as key frames for visual processing.
Keywords :
biomechanics; brain; hearing; learning (artificial intelligence); medical computing; neurophysiology; spatiotemporal phenomena; visual perception; auditory modality; cut points; machine learning; motion static points; parallel modality pathways; sensory inputs; sensory modalities; silence; skull-enclosed brain; spatial pattern; spatio-temporal multimodal developmental learning; speech processing; visual modality; visual processing; Contracts; Humans; Indium tin oxide; Joining processes; Machine learning; Permission; Spatiotemporal phenomena; Speech processing; Speech recognition; Uncertainty; Developmental architecture; multimodal development; speech recognition; visual recognition;
Journal_Title :
Autonomous Mental Development, IEEE Transactions on
DOI :
10.1109/TAMD.2010.2051437