DocumentCode
2349297
Title
Advance statistical modeling for speaker verification
Author
Waghmare, Rupali K. ; Asutkar, Vinayak G.
Author_Institution
Shri Guru Gobind Singhji Inst. of Eng. & Technol., Nanded
fYear
2008
fDate
3-5 June 2008
Firstpage
2535
Lastpage
2539
Abstract
Speaker verification deals with the problem of verifying whether a given utterance has been pronounced by a claimed authorized speaker. This problem is important because an accurate speaker verification system can be applied to many security applications. Fundamental frequency is the rate of vocal folds vibration during speech. It is considered to be one of the most important prosodic features to characterize speech and speaker specific patterns. This study considers two aspects: the estimation of fundamental frequency and its modeling for speaker recognition. Prosodic information has been applied in two main ways are global statistics and dynamic time warping. To understand this prosodic parameter, we present an analysis of three popular algorithms: correlation, kurtosis and kernel function. In this paper, we present a new model for speaker verification without background model, which is called correlation and kernel function method (CK method). In CK method, the correlation and un-correlation of MFCC are used to identify individuals, and a kernel function is used to work out the likelihood of two models. This method is faster than GMM method, requires fewer data to train and also less space to store the model. From experimental results we say that the kernel function has higher accuracy than correlation and kurtosis method.
Keywords
speaker recognition; statistical analysis; correlation algorithm; dynamic time warping; fundamental frequency; kernel function; kurtosis algorithm; prosodic information; speaker recognition; speaker verification; statistical modeling; Frequency estimation; Gaussian processes; Interference; Kernel; Space technology; Speaker recognition; Speech; Statistical distributions; Statistics; Testing; Kernel Function; Kurtosis; Prosodic Feature Correlation Matrix; Speaker Verification;
fLanguage
English
Publisher
ieee
Conference_Titel
Industrial Electronics and Applications, 2008. ICIEA 2008. 3rd IEEE Conference on
Conference_Location
Singapore
Print_ISBN
978-1-4244-1717-9
Electronic_ISBN
978-1-4244-1718-6
Type
conf
DOI
10.1109/ICIEA.2008.4582976
Filename
4582976
Link To Document