مرکز منطقه ای اطلاع رساني علوم و فناوري - Speaker Recognition via Statistics of Acoustic Feature Distribution

DocumentCode :

2929607

Title :

Speaker Recognition via Statistics of Acoustic Feature Distribution

Author :

Li Shaomei ; Guo Yunfei ; Wei Hongquan

Author_Institution :

Nat. Digital Switching Syst. Res. Center, Zhengzhou, China

Volume :

fYear :

2009

fDate :

18-20 Nov. 2009

Firstpage :

190

Lastpage :

192

Abstract :

In recent years, with the widely application of speaker recognition, besides recognition precision, people pay more attention to processing speed. The traditional recognition frame which directly uses acoustic feature sequence to match the model of target speaker doesn´t work well in real time environment. So new recognition frame based on the statistic of acoustic feature sequence has arisen. By using acoustic feature distribution around common codebook to model speaker´s charactistic, a new speaker recognition algorithm is proposed in this paper. The common codebook is generated via the training data from all reference speakers, which is used to classify speech feature space, and the model of each reference speaker is described by the statistics of speaker´s acoustic feature distribution around the common codebook. In the recognition, pairwise sequence alignment is adopted to measure the distortion between the acoustic feature distribution of test speech and each reference speaker model, and recognition result is derived by distortion comparison. Experimental results showed that the method proposed in this paper can save calculation and space resource while having better performance over current algorithm which also is based on the statistic of speaker´s acoustic feature distribution.

Keywords :

acoustic signal processing; feature extraction; signal classification; speaker recognition; statistical distributions; acoustic feature distribution; acoustic feature sequence statistic; common codebook; pairwise sequence alignment; recognition precision; reference speaker model; speaker recognition; speech feature space classification; Acoustic distortion; Acoustic measurements; Acoustic testing; Distortion measurement; Loudspeakers; Speaker recognition; Speech recognition; Statistical distributions; Target recognition; Training data;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Multimedia Information Networking and Security, 2009. MINES '09. International Conference on

Conference_Location :

Hubei

Print_ISBN :

978-0-7695-3843-3

Electronic_ISBN :

978-1-4244-5068-8

Type :

conf

DOI :

10.1109/MINES.2009.140

Filename :

5370132

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2929607