DocumentCode
2426492
Title
Different aspects of source information for limited data speaker verification
Author
Das, Rohan Kumar ; Pati, Debadatta ; Mahadeva Prasanna, S.R.
Author_Institution
Dept. of Electron. & Electr. Eng., Indian Inst. of Technol. Guwahati, Guwahati, India
fYear
2015
fDate
Feb. 27 2015-March 1 2015
Firstpage
1
Lastpage
6
Abstract
Limited data speaker verification has shown its significance in practical system oriented applications. The paper shows the importance of different aspects of voice source feature for limited test data scenario. A baseline speaker verification system using conventional mel frequency cepstral co-efficients (MFCC) feature is developed and performance under limited test data condition (≤10 s) is evaluated. A parallel system based on source feature mel power difference of spectrum in subband (M-PDSS) is developed in the i-vector based speaker verification framework. Both the systems were fused at the score level for the cases of short segments of test speech, which demonstrated the importance of source feature with reduction in test data duration. A comparative study of the M-PDSS feature is then made with our earlier work using discrete cosine transform of the integrated linear prediction residual (DCTILPR) feature and then fusion of two source features M-PDSS and DCTILPR along with MFCC features is carried out. An absolute improvement of 5.19% is obtained for 2 s of test data which conveys the significance of multiple source information under limited data speaker verification as it carries different aspects of source information.
Keywords
cepstral analysis; speaker recognition; DCTILPR feature; M-PDSS feature; MFCC feature; baseline speaker verification system; conventional mel frequency cepstral co-efficients feature; discrete cosine transform of integrated linear prediction residual feature; limited data speaker verification; multiple source information; practical system oriented applications; test data condition; voice source feature; Decision support systems; Dynamic range; Feature extraction; Handheld computers; Market research; Mel frequency cepstral coefficient; NIST; DCTILPR; M-PDSS; MFCC; short utterances; source features; speaker verification;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications (NCC), 2015 Twenty First National Conference on
Conference_Location
Mumbai
Type
conf
DOI
10.1109/NCC.2015.7084846
Filename
7084846
Link To Document