Loudness measurement of human utterance to a robot in noisy environment

Author

Kagami, Satoshi ; Sasaki, Yoko ; Thompson, Simon ; Fujihara, Tomoaki ; Enomoto, Tadashi ; Mizoguchi, Hiroshi

Author_Institution

Digital Human Res. Center, AIST, Tokyo, Japan

fYear

2008

fDate

12-15 March 2008

Firstpage

217

Lastpage

224

Abstract

In order to understand utterance based human-robot interation, and to develop such a system, this paper initially analyzes how loud humans speak in a noisy environment. Experiments were conducted to measure how loud humans speak with 1) different noise levels, 2) different number of sound sources, 3) different sound sources, and 4) different distances to a robot. Synchronized sound sources add noise to the auditory scene, and resultant utterances are recorded and compared to a previously recorded noiseless utterance. From experiments, we understand that humans generate basically the same level of sound pressure level at his/her location irrespective of distance and background noise. More precisely, there is a band according to a distance, and also according to sound sources that is including language pronounce. According to this understanding, we developed an online spoken command recognition system for a mobile robot. System consists of two key componenets: 1) Low side-lobe microphone array that works as omini-directional telescopic microphone, and 2) DSBF combined with FBS method for sound source localization and segmentation. Caller location and segmented sound stream are calculated, and then the segmented sound stream is sent to voice recognition system. The system works with at most five sound sources at the same time with about at most 18[dB] sound pressure differences. Experimental results with the modile robot are also shown.

Keywords

distance measurement; human-robot interaction; microphone arrays; mobile robots; speaker recognition; DSBF; FBS method; auditory scene; background noise; caller location; human utterance; human-robot interation; language pronounce; loudness measurement; low side-lobe microphone array; mobile robot; noise level; noisy environment; omini-directional telescopic microphone; online spoken command recognition system; recorded noiseless utterance; robot distance; sound pressure level; sound source localization; sound source segmentation; sound stream segmentation; synchronized sound source; voice recognition system; Arrays; Humans; Microphones; Noise; Noise level; Noise measurement; Robots;

fLanguage

English

Publisher

ieee

Conference_Titel

Human-Robot Interaction (HRI), 2008 3rd ACM/IEEE International Conference on

Conference_Location

Amsterdam

ISSN

2167-2121

Print_ISBN

978-1-60558-017-3

Type

conf

Filename

6249438