Title :
A Combined Features Approach for Speaker Segmentation Using BIC and Artificial Neural Networks
Author :
Valeriano Neri, Leonardo ; Tsang Ing Ren ; Cavalcanti, G.D.C. ; Tsang Ing Jyh ; Sijbers, J.
Author_Institution :
Centro de Inf. (CIn), Univ. Fed. de Pernambuco (UFPE), Recife, Brazil
Abstract :
We present a combined features approach for speaker segmentation task. This approach utilizes different acoustic features extracted from audio stream. The Bayesian Information Criterion (BIC) is used for each acoustic feature as a distance measure to verify the merging of two audio segments. An Artificial Neural Network (ANN) combines the time index from each ?BIC with the highest value, and estimates the change point. In the experiments, a data set containing examples with several speakers is used to compare our approach with the Chen and Gopalakrishnan´s window-growing-based approach, using different acoustic features sets. The results show an improvement in both the Miss Detection Rate (MDR) and the False Alarm Rate (FAR) compared to the window-growing-based approach.
Keywords :
Bayes methods; acoustic signal processing; audio signal processing; feature extraction; learning (artificial intelligence); neural nets; speaker recognition; ΔBIC; ANN; Bayesian information criterion; FAR; MDR; acoustic feature extraction; acoustic feature sets; artificial neural networks; audio segment merging; audio stream; change point estimation; combined feature approach; distance measure; false alarm rate; miss detection rate; speaker segmentation task; time index; window-growing-based approach; Acoustics; Artificial neural networks; Detection algorithms; Feature extraction; Speech; Speech processing; Training; Bayesian information criterion; artificial neural networks; combined features; speaker segmentation;
Conference_Titel :
Systems, Man, and Cybernetics (SMC), 2013 IEEE International Conference on
Conference_Location :
Manchester
DOI :
10.1109/SMC.2013.739