Title :
A Comparable Study on PNCC in Speaker Diarization for Meetings
Author :
Li, Qiao ; Fan, Qing ; Xiao, Yunpeng ; Ye, Weiping
Author_Institution :
Coll. of Inf. Sci. & Technol., Beijing Normal Univ., Beijing, China
Abstract :
In speaker diarization, the most commonly used speaker feature is MFCC, which is also most commonly used speech feature in speech recognition. The newly proposed Power Normalized Cepstrum Coefficients (PNCC) achieves impressive improvement in noisy speech recognition compare to MFCC. It consequently expects a proof for speaker diarization use. In this paper, PNCC is evaluated against MFCC in a meeting domain speaker diarization system. The Diarization Error Rate (DER) shows no positive results with PNCC. This is possibly because of their inhibition in high frequency spectrum which is believed to represents the characteristics of human´s voice.
Keywords :
speaker recognition; PNCC; diarization error rate; meeting domain speaker diarization system; power normalized cepstrum coefficients; speech feature; Density estimation robust algorithm; Hidden Markov models; Materials; Mel frequency cepstral coefficient; Speech; Speech recognition; Training; DER; MFCC; PNCC; speaker diarization;
Conference_Titel :
Cryptography and Network Security, Data Mining and Knowledge Discovery, E-Commerce & Its Applications and Embedded Systems (CDEE), 2010 First ACIS International Symposium on
Conference_Location :
Qinhuangdao
Print_ISBN :
978-1-4244-9595-5
DOI :
10.1109/CDEE.2010.40