مرکز منطقه ای اطلاع رساني علوم و فناوري - Spatio–Temporal Multimodal Developmental Learning

DocumentCode :

1503947

Title :

Spatio–Temporal Multimodal Developmental Learning

Author :

Zhang, Yilu ; Weng, Juyang

Author_Institution :

Gen. Motors Global R&D, Warren, MI, USA

Volume :

Issue :

fYear :

2010

Firstpage :

149

Lastpage :

166

Abstract :

It is elusive how the skull-enclosed brain enables spatio-temporal multimodal developmental learning. By multimodal, we mean that the system has at least two sensory modalities, e.g., visual and auditory in our experiments. By spatio-temporal, we mean that the behavior from the system depends not only on the spatial pattern in the current sensory inputs, but also those of the recent past. Traditional machine learning requires humans to train every module using hand-transcribed data, using handcrafted symbols among modules, and hand-link modules internally. Such a system is limited by a static set of symbols and static module performance. A key characteristic of developmental learning is that the “brain” is “skull-closed” after birth - not directly manipulatable by the system designer - so that the system can continue to learn incrementally without the need for reprogramming. In this paper, we propose an architecture for multimodal developmental learning - parallel modality pathways all situate between a sensory end and the motor end. Motor signals are not only used as output behaviors, but also as part of input to all the related pathways. For example, the proposed developmental learning does not use silence as cut points for speech processing or motion static points as key frames for visual processing.

Keywords :

biomechanics; brain; hearing; learning (artificial intelligence); medical computing; neurophysiology; spatiotemporal phenomena; visual perception; auditory modality; cut points; machine learning; motion static points; parallel modality pathways; sensory inputs; sensory modalities; silence; skull-enclosed brain; spatial pattern; spatio-temporal multimodal developmental learning; speech processing; visual modality; visual processing; Contracts; Humans; Indium tin oxide; Joining processes; Machine learning; Permission; Spatiotemporal phenomena; Speech processing; Speech recognition; Uncertainty; Developmental architecture; multimodal development; speech recognition; visual recognition;

fLanguage :

English

Journal_Title :

Autonomous Mental Development, IEEE Transactions on

Publisher :

ieee

ISSN :

1943-0604

Type :

jour

DOI :

10.1109/TAMD.2010.2051437

Filename :

5473115

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1503947