DocumentCode
1867799
Title
Analyzing pitch robustness of PMVDR and MFCC features for children´s speech recognition
Author
Ghai, Sunil ; Sinha, Roopak
Author_Institution
Dept. of Electron. & Commun. Eng., Indian Inst. of Technol. Guwahati, Guwahati, India
fYear
2010
fDate
18-21 July 2010
Firstpage
1
Lastpage
5
Abstract
The degradation in children´s speech recognition performance under mismatched condition i.e., on the adults´ speech trained models is a well known problem. Apart from several other factors, this degradation is also contributed by the large difference in the pitch values of the adults´ and the children´s speech. MFCC is the most commonly used feature in automatic speech recognition but it has been reported to be affected by the pitch variations across speech signals. Recently, perceptual-MVDR (PMVDR) feature has been reported as a better alternative to MFCC under noisy conditions. It is also attributed to possess better spectral modeling ability for high pitch signals. Motivated by these, in this work, we analyze the robustness of PMVDR to pitch variations across speech signals in comparison to MFCC for the children´s speech recognition under mismatched condition. Our study finds PMVDR to be more pitch robust than MFCC using the default parameters. However, on suitable adaptation of the parameters for the children´s speech recognition under mismatched condition, both PMVDR and MFCC give significantly improved comparable performances for children´s speech as well as exhibit similar robustness to pitch variations.
Keywords
speech recognition; MFCC; PMVDR; adults´ speech trained models; automatic speech recognition; children´s speech recognition; pitch robustness; Bandwidth; Mel frequency cepstral coefficient; Pediatrics; Robustness; Smoothing methods; Speech; Speech recognition; Children´s speech recognition; MFCC; PMVDR; pitch robustness;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing and Communications (SPCOM), 2010 International Conference on
Conference_Location
Bangalore
Print_ISBN
978-1-4244-7137-9
Type
conf
DOI
10.1109/SPCOM.2010.5560549
Filename
5560549
Link To Document