Title :
Speech/Music Discrimination for Robust Speech Recognition in Robots
Author :
Choi, Mu Yeol ; Song, Hwa Jeon ; Kim, Hyung Soon
Author_Institution :
Pusan Nat. Univ., Pusan
Abstract :
Automatic speech recognition (ASR) is one indispensable technology to communicate with a service robot. In real-world environments, ASR faces many kinds of sound sources and they should be discriminated to improve ASR performance. In ASR systems, speech is usually detected from the input signal by voice activity detection (VAD) scheme. Speech and music, how ever, are not easily discriminated by the VAD because they share similar characteristics such as periodicity. In this paper, we adopt a speech/music discriminator into the front-end of the ASR system in order to disable music stream not to be an input for the ASR system. Our speech/music discriminator employs the mean of minimum cepstral distances (MMCD) as a feature parameter. Experimental result shows the MMCD parameter outperforms the conventional feature parameter, spectral flux.
Keywords :
robots; speech recognition; speech-based user interfaces; automatic speech recognition; minimum cepstral distances; music discrimination; robots; robust speech recognition; speech discrimination; voice activity detection; Automatic speech recognition; Cepstral analysis; Entropy; Human robot interaction; Mel frequency cepstral coefficient; Microphone arrays; Robotics and automation; Robustness; Service robots; Speech recognition;
Conference_Titel :
Robot and Human interactive Communication, 2007. RO-MAN 2007. The 16th IEEE International Symposium on
Conference_Location :
Jeju
Print_ISBN :
978-1-4244-1634-9
Electronic_ISBN :
978-1-4244-1635-6
DOI :
10.1109/ROMAN.2007.4415064