DocumentCode :
2394136
Title :
Vowel-category based Short Utterance Speaker Recognition
Author :
Fatima, Nakhat ; Zheng, Thomas Fang
Author_Institution :
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
fYear :
2012
fDate :
19-20 May 2012
Firstpage :
1774
Lastpage :
1778
Abstract :
The impact of Short Utterances in Speaker Recognition is of significant importance. Despite the advancements in short utterance speaker recognition (SUSR), text dependence and the role of phonemes in carrying speaker information needs further investigation. This paper presents a novel method of using vowel categories for SUSR. We define Vowel Categories (VC´s) considering Chinese and English languages. After recognition and extraction of phonemes, the obtained vowels are divided into VC´s, which are then used to develop Universal Background VC Models (UBVCM) for each VC. Conventional GMM-UBM system is used for training and testing. The proposed categories give minimum EERs of 13.76%, 14.03% and 16.18% for 3, 2 and 1 second respectively. Experimental results show that in text dependent SUSR, significant speaker-specific information is present at phoneme level. The similar properties of phonemes can be used such that accurate speech recognition is not required, rather Phoneme Categories can be used effectively for SUSR. Also, it is shown that vowels contain large amount of speaker information, which remains undisturbed when VC are employed.
Keywords :
speaker recognition; Chinese language; English language; GMM-UBM system; phonemes; short utterance; speaker information; speaker recognition; speech recognition; text dependence; universal background VC model; vowel category; Feature extraction; Hidden Markov models; Speaker recognition; Speech; Speech recognition; Testing; Training; Short Utterance Speaker Recognition; Universal Background Vowel Category Model; Vowel Categories;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems and Informatics (ICSAI), 2012 International Conference on
Conference_Location :
Yantai
Print_ISBN :
978-1-4673-0198-5
Type :
conf
DOI :
10.1109/ICSAI.2012.6223387
Filename :
6223387
Link To Document :
بازگشت