DocumentCode :
3776542
Title :
Speaker identification with whispered speech mode using MFCC: Challenges to whispered speech identification
Author :
V. M. Sardar;S. D. Shrbahadurkar
Author_Institution :
JSPM´s Rajarshi Shahu College of Engineering, Tathawade, Pune, India. Savitribai Phule Pune University
fYear :
2015
Firstpage :
70
Lastpage :
74
Abstract :
Whispered mode of speech is preferred by people for confidential communication. To identify speaker from his whispered mode of utterances is challenging compared to neutral speech as the speaker identification rate is found to be reducing in the whispered case. This paper uses the MFCC (Mel-Frequency Cepstrum Coefficients) for feature extraction and VQ (Vector Quantization) for matching to identify speaker. Here, closed set of 35 speaker´s database is generated to test the system: both whispered and neutral utterances of all speakers are recorded. When trained with neutral speech sample and test query was also neutral, identification is 91.4%. At the same time for neutral speech trained system when the test sample was whispered, identification rate reaches as low as 68.56%. The identification of unknown speaker is based on the Euclidean distance. The GUI created in MATLAB is used for displaying the result of speaker identification. The systematic analysis is made to find the reasons of deteriorating performance in the whispered case and probable remedies are suggested.
Keywords :
"Speech","Feature extraction","Training","Filter banks","Mel frequency cepstral coefficient","Euclidean distance","Testing"
Publisher :
ieee
Conference_Titel :
Information Processing (ICIP), 2015 International Conference on
Type :
conf
DOI :
10.1109/INFOP.2015.7489353
Filename :
7489353
Link To Document :
بازگشت