• DocumentCode
    565470
  • Title

    Loudness measurement of human utterance to a robot in noisy environment

  • Author

    Kagami, Satoshi ; Sasaki, Yoko ; Thompson, Simon ; Fujihara, Tomoaki ; Enomoto, Tadashi ; Mizoguchi, Hiroshi

  • Author_Institution
    Digital Human Res. Center, AIST, Tokyo, Japan
  • fYear
    2008
  • fDate
    12-15 March 2008
  • Firstpage
    217
  • Lastpage
    224
  • Abstract
    In order to understand utterance based human-robot interation, and to develop such a system, this paper initially analyzes how loud humans speak in a noisy environment. Experiments were conducted to measure how loud humans speak with 1) different noise levels, 2) different number of sound sources, 3) different sound sources, and 4) different distances to a robot. Synchronized sound sources add noise to the auditory scene, and resultant utterances are recorded and compared to a previously recorded noiseless utterance. From experiments, we understand that humans generate basically the same level of sound pressure level at his/her location irrespective of distance and background noise. More precisely, there is a band according to a distance, and also according to sound sources that is including language pronounce. According to this understanding, we developed an online spoken command recognition system for a mobile robot. System consists of two key componenets: 1) Low side-lobe microphone array that works as omini-directional telescopic microphone, and 2) DSBF combined with FBS method for sound source localization and segmentation. Caller location and segmented sound stream are calculated, and then the segmented sound stream is sent to voice recognition system. The system works with at most five sound sources at the same time with about at most 18[dB] sound pressure differences. Experimental results with the modile robot are also shown.
  • Keywords
    distance measurement; human-robot interaction; microphone arrays; mobile robots; speaker recognition; DSBF; FBS method; auditory scene; background noise; caller location; human utterance; human-robot interation; language pronounce; loudness measurement; low side-lobe microphone array; mobile robot; noise level; noisy environment; omini-directional telescopic microphone; online spoken command recognition system; recorded noiseless utterance; robot distance; sound pressure level; sound source localization; sound source segmentation; sound stream segmentation; synchronized sound source; voice recognition system; Arrays; Humans; Microphones; Noise; Noise level; Noise measurement; Robots;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Human-Robot Interaction (HRI), 2008 3rd ACM/IEEE International Conference on
  • Conference_Location
    Amsterdam
  • ISSN
    2167-2121
  • Print_ISBN
    978-1-60558-017-3
  • Type

    conf

  • Filename
    6249438