DocumentCode :
2929607
Title :
Speaker Recognition via Statistics of Acoustic Feature Distribution
Author :
Li Shaomei ; Guo Yunfei ; Wei Hongquan
Author_Institution :
Nat. Digital Switching Syst. Res. Center, Zhengzhou, China
Volume :
2
fYear :
2009
fDate :
18-20 Nov. 2009
Firstpage :
190
Lastpage :
192
Abstract :
In recent years, with the widely application of speaker recognition, besides recognition precision, people pay more attention to processing speed. The traditional recognition frame which directly uses acoustic feature sequence to match the model of target speaker doesn´t work well in real time environment. So new recognition frame based on the statistic of acoustic feature sequence has arisen. By using acoustic feature distribution around common codebook to model speaker´s charactistic, a new speaker recognition algorithm is proposed in this paper. The common codebook is generated via the training data from all reference speakers, which is used to classify speech feature space, and the model of each reference speaker is described by the statistics of speaker´s acoustic feature distribution around the common codebook. In the recognition, pairwise sequence alignment is adopted to measure the distortion between the acoustic feature distribution of test speech and each reference speaker model, and recognition result is derived by distortion comparison. Experimental results showed that the method proposed in this paper can save calculation and space resource while having better performance over current algorithm which also is based on the statistic of speaker´s acoustic feature distribution.
Keywords :
acoustic signal processing; feature extraction; signal classification; speaker recognition; statistical distributions; acoustic feature distribution; acoustic feature sequence statistic; common codebook; pairwise sequence alignment; recognition precision; reference speaker model; speaker recognition; speech feature space classification; Acoustic distortion; Acoustic measurements; Acoustic testing; Distortion measurement; Loudspeakers; Speaker recognition; Speech recognition; Statistical distributions; Target recognition; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Information Networking and Security, 2009. MINES '09. International Conference on
Conference_Location :
Hubei
Print_ISBN :
978-0-7695-3843-3
Electronic_ISBN :
978-1-4244-5068-8
Type :
conf
DOI :
10.1109/MINES.2009.140
Filename :
5370132
Link To Document :
بازگشت