• DocumentCode
    2648023
  • Title

    On the active perception of speech by robots

  • Author

    Kabré, Harouna

  • Author_Institution
    CLIPS-IMAG Lab., Joseph Fourier Univ., Grenoble, France
  • fYear
    1996
  • fDate
    8-11 Dec 1996
  • Firstpage
    765
  • Lastpage
    774
  • Abstract
    We describe an autonomous agent approach to automatic speech recognition which is based on the link of two models: a virtual environment model (VEM) and a virtual speaker model (VSM). The VEM is a system which can generate some synthetic signals of different wave lengths and can record real world data from a camera and a microphone. The VSM is a speech synthesis model with some controllable parameters which can be used to synthesize speech signal which varies according to the characteristics of an unknown speaker. VEM and VSM are instantiated to train artificial neural networks which extract and integrate the auditory and the visual information paths for the purpose of robust automatic speech recognition. Such an instance is called an autonomous speech recognition agent (ASRA) or equivalently a speech robot. Finally, the problem of robust automatic speech recognition in this new framework amounts to select the best ASRA for a given pair of VEM and VSM. The paper describes the simulation environment and presents the potential applications of this new model in the framework of data fusion, of ASRAs evaluation and of emerging properties of auto-adaptive systems
  • Keywords
    robots; sensor fusion; signal synthesis; speech recognition; speech synthesis; active speech perception; artificial neural networks; auto-adaptive systems; automatic speech recognition; autonomous speech recognition agent; data fusion; speech robot; speech synthesis model; synthetic signals; virtual environment model; virtual speaker model; Automatic speech recognition; Autonomous agents; Cameras; Microphones; Robotics and automation; Robots; Robustness; Signal generators; Speech synthesis; Virtual environment;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multisensor Fusion and Integration for Intelligent Systems, 1996. IEEE/SICE/RSJ International Conference on
  • Conference_Location
    Washington, DC
  • Print_ISBN
    0-7803-3700-X
  • Type

    conf

  • DOI
    10.1109/MFI.1996.572314
  • Filename
    572314