DocumentCode :
2426928
Title :
Speaker change detection using excitation source and vocal tract system information
Author :
Sarma, Mousmita ; Gadre, Sree Nilendra ; Sarma, Biswajit Dev ; Mahadeva Prasanna, S.R.
Author_Institution :
Dept. of Electron. & Commun. Eng., Gauhati Univ., Guwahati, India
fYear :
2015
fDate :
Feb. 27 2015-March 1 2015
Firstpage :
1
Lastpage :
6
Abstract :
The speaker change information in speech is due to both vocal tract and excitation source information. In this work, the excitation source information is extracted by computing cepstral features from the zero frequency filtered speech (ZFFS) signal. The vocal tract system information is extracted by computing cepstral features from the speech signal. The speaker change evidences obtained from these two feature sets are combined and observed that they contain complementary information for speaker change detection. The popular distance metric based algorithms, Bayesian Information Criteria (BIC) and Kullback Leibler Divergence (KLD) are used to detect the speaker change evidences. The Miss Detection Rate (MDR) of BIC based algorithm using cepstral features obtained from speech is 24.18% and from ZFFS is 25.92%, respectively. When the two sets of evidences are combined, the MDR reduces to 15.89%. Similarly, individual MDR of KLD based algorithm from speech and ZFFS are 32.24% and 45.17%, respectively, where as the combination reduces the MDR to 19.67%. Experiments are also performed with noisy speech signal and similar reduction of MDR is observed. This demonstrates the usefulness of cepstral features from the excitation source signal for reducing MDR.
Keywords :
Bayes methods; acoustic signal processing; cepstral analysis; speech processing; Bayesian Information Criteria; Kullback Leibler Divergence; Miss Detection Rate; ZFFS signal; cepstral features; excitation source information; noisy speech signal; speaker change detection; vocal tract system; zero frequency filtered speech signal; Databases; Feature extraction; Mel frequency cepstral coefficient; Noise measurement; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications (NCC), 2015 Twenty First National Conference on
Conference_Location :
Mumbai
Type :
conf
DOI :
10.1109/NCC.2015.7084869
Filename :
7084869
Link To Document :
بازگشت