Title :
Robust speech recognition for similar pronunciation phrases using MMSE under noise environments
Author :
Watanabe, Manabu ; Tsutsui, H. ; Miyanaga, Yoshikazu
Author_Institution :
Grad. Sch. of Inf. Sci. & Technol., Hokkaido Univ., Sapporo, Japan
Abstract :
In this paper, we propose a robust speech recognition method for similar pronunciation phrases. Along with the popularization of information devices such as personal computers and smart-phones, many applications controlled by voice have spread in the society. In order to increase the speech accuracy under a real environment, it is extremely important to discriminate similar pronunciation phrases. In the proposed method, linear prediction theory (LPC) is used for spectral analysis while cepstrum mean subtraction (CMS) and dynamic range adjustment (DRA) is used for a noise reduction method. The speech accuracy was recorded 68.7 % in SNR 10 dB by using the proposed methods. In conclusion, LPC+CMS/DRA is the most effective method to discriminate similar pronunciation phrases.
Keywords :
least mean squares methods; speech recognition; LPC+CMS/DRA; MMSE; cepstrum mean subtraction; dynamic range adjustment; information devices; linear prediction theory; noise environments; noise reduction method; personal computers; robust speech recognition; similar pronunciation phrases; smart-phones; spectral analysis; speech accuracy; voice control; Cepstrum; Dynamic range; Noise; Noise reduction; Spectral analysis; Speech; Speech recognition;
Conference_Titel :
Communications and Information Technologies (ISCIT), 2013 13th International Symposium on
Conference_Location :
Surat Thani
Print_ISBN :
978-1-4673-5578-0
DOI :
10.1109/ISCIT.2013.6645971