• DocumentCode
    1503947
  • Title

    Spatio–Temporal Multimodal Developmental Learning

  • Author

    Zhang, Yilu ; Weng, Juyang

  • Author_Institution
    Gen. Motors Global R&D, Warren, MI, USA
  • Volume
    2
  • Issue
    3
  • fYear
    2010
  • Firstpage
    149
  • Lastpage
    166
  • Abstract
    It is elusive how the skull-enclosed brain enables spatio-temporal multimodal developmental learning. By multimodal, we mean that the system has at least two sensory modalities, e.g., visual and auditory in our experiments. By spatio-temporal, we mean that the behavior from the system depends not only on the spatial pattern in the current sensory inputs, but also those of the recent past. Traditional machine learning requires humans to train every module using hand-transcribed data, using handcrafted symbols among modules, and hand-link modules internally. Such a system is limited by a static set of symbols and static module performance. A key characteristic of developmental learning is that the “brain” is “skull-closed” after birth - not directly manipulatable by the system designer - so that the system can continue to learn incrementally without the need for reprogramming. In this paper, we propose an architecture for multimodal developmental learning - parallel modality pathways all situate between a sensory end and the motor end. Motor signals are not only used as output behaviors, but also as part of input to all the related pathways. For example, the proposed developmental learning does not use silence as cut points for speech processing or motion static points as key frames for visual processing.
  • Keywords
    biomechanics; brain; hearing; learning (artificial intelligence); medical computing; neurophysiology; spatiotemporal phenomena; visual perception; auditory modality; cut points; machine learning; motion static points; parallel modality pathways; sensory inputs; sensory modalities; silence; skull-enclosed brain; spatial pattern; spatio-temporal multimodal developmental learning; speech processing; visual modality; visual processing; Contracts; Humans; Indium tin oxide; Joining processes; Machine learning; Permission; Spatiotemporal phenomena; Speech processing; Speech recognition; Uncertainty; Developmental architecture; multimodal development; speech recognition; visual recognition;
  • fLanguage
    English
  • Journal_Title
    Autonomous Mental Development, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1943-0604
  • Type

    jour

  • DOI
    10.1109/TAMD.2010.2051437
  • Filename
    5473115