Title :
A comparison of composite features under degraded speech in speaker recognition
Author :
Openshaw, J.P. ; Sun, Z.P. ; Mason, J.S.
Author_Institution :
Univ. Coll. of Swansea, UK
Abstract :
A variety of features and their sensitivity to noise mismatch between the model and test noise conditions are assessed. The authors use speaker identification (SI) for a performance evaluation as it is very sensitive to feature changes, and propose a target for robustness in terms of matched noise conditions. Two primary features, mel frequency cepstral coefficients (MFCCs) and PLP, are considered along with their RASTA and first-order regression extensions. PLP-RASTA is found to give the best resilience under cross conditions for a single feature, and the linear discriminant analysis (LDA) combination of MFCC and PLP-RASTA gives the best performance overall. Only in combined training are satisfactory results for any feature found.<>
Keywords :
learning (artificial intelligence); sensitivity analysis; speech recognition; PLP; RASTA; combined training; degraded speech; first-order regression; linear discriminant analysis; mel frequency cepstral coefficients; performance evaluation; resilience; robustness; sensitivity to noise mismatch; speaker identification;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location :
Minneapolis, MN, USA
Print_ISBN :
0-7803-7402-9
DOI :
10.1109/ICASSP.1993.319316