DocumentCode :
1693923
Title :
Speaker identification from shouted speech: Analysis and compensation
Author :
Hanilci, Cemal ; Kinnunen, Tomi ; Saeidi, Rahim ; Pohjalainen, Jouni ; Alku, Paavo ; Ertas, Figen
Author_Institution :
Dept. of Electron. Eng., Uludag Univ., Bursa, Turkey
fYear :
2013
Firstpage :
8027
Lastpage :
8031
Abstract :
Text-independent speaker identification is studied using neutral and shouted speech in Finnish to analyze the effect of vocal mode mismatch between training and test utterances. Standard mel-frequency cepstral coefficient (MFCC) features with Gaussian mixture model (GMM) recognizer are used for speaker identification. The results indicate that speaker identification accuracy reduces from perfect (100 %) to 8.71 % under vocal mode mismatch. Because of this dramatic degradation in recognition accuracy, we propose to use a joint density GMM mapping technique for compensating the MFCC features. This mapping is trained on a disjoint emotional speech corpus to create a completely speaker- and speech mode independent emotion-neutralizing mapping. As a result of the compensation, the 8.71 % identification accuracy increases to 32.00 % without degrading the non-mismatched train-test conditions much.
Keywords :
Gaussian processes; speaker recognition; speech processing; text analysis; Finnish; GMM; Gaussian mixture model; MFCC; disjoint emotional speech corpus; nonmismatched train-test condition; speaker recognition; speaker-mode independent emotion-neutralizing mapping; speech analysis; speech mode independent emotion-neutralizing mapping; speech recognition; standard melfrequency cepstral coefficient; test utterance; text-independent speaker identification; training utterance; vocal mode mismatch; Accuracy; Joints; Mel frequency cepstral coefficient; Speech; Speech recognition; Training; Vectors; shouted speech; speaker identification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639228
Filename :
6639228
Link To Document :
بازگشت