• DocumentCode
    3343102
  • Title

    Sound interval detection of multiple sources based on sound directivity

  • Author

    Ishi, Carlos T. ; Dong, Liang ; Ishiguro, Hiroshi ; Hagita, Norihiro

  • Author_Institution
    ATR Intell. Robot. & Commun. Labs., Kyoto, Japan
  • fYear
    2010
  • fDate
    18-22 Oct. 2010
  • Firstpage
    1982
  • Lastpage
    1987
  • Abstract
    Utterance interval detection is a bottleneck for the current speech recognition performance in robots embedded in real noisy environments. In the present work, we make use of sound localization technology using a microphone array, not only for localizing, but also for detecting sound intervals of multiple sound sources. In our previous work we have implemented and evaluated sound localization in the 3D-space using the MUSIC (MUltiple SIgnal Classification) method. In the present work, we proposed a method for detecting sound intervals based on the sound directivity information inferred from the dynamics of the MUSIC spectrogram. The proposed method achieved high sound interval detection accuracies and low insertion rates compared with the previous sound localization results.
  • Keywords
    humanoid robots; microphone arrays; mobile robots; signal classification; speech recognition; 3D-space; MUSIC spectrogram; high sound interval detection; low insertion rates; microphone array; multiple signal classification; multiple sound sources; real noisy environment; sound directivity information; sound interval detection; sound localization technology; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Robots and Systems (IROS), 2010 IEEE/RSJ International Conference on
  • Conference_Location
    Taipei
  • ISSN
    2153-0858
  • Print_ISBN
    978-1-4244-6674-0
  • Type

    conf

  • DOI
    10.1109/IROS.2010.5652016
  • Filename
    5652016