Title :
Voice Activity Detection based on Combination of Multiple Features using Linear/Kernel Discriminant Analyses
Author :
Soleimani, S.A. ; Ahadi, S.M.
Author_Institution :
Electr. Eng. Dept., Amirkabir Univ. of Tehran, Tehran
Abstract :
This paper presents a voice activity detection (VAD) scheme that uses multiple of some popular features. As, in each noisy condition, one type of feature performs best in speech/non- speech classification, combining features can lead to a better performance. Features are combined linearly with weights that were obtained for each condition in training stage via a method of classification and dimension reduction, i.e. linear discriminant analysis (LDA). Also, nonlinear combination of features is carried out via kernel discriminant analysis (KDA). The results are compared to MCE-based approach. We show that LDA and KDA can lead to better performance, compared to MCE and also the best single feature. In particular, KDA shows 12.14% improvement in average speech/non-speech classification rate, relative to the best feature.
Keywords :
speech recognition; dimension reduction; kernel discriminant analyses; linear discriminant analyses; minimum classification error; nonspeech classification; speech classification; voice activity detection; Entropy; Kernel; Linear discriminant analysis; Noise robustness; Signal processing; Speech analysis; Speech enhancement; Speech processing; Speech recognition; Kernel Discriminant Analysis; Linear Discriminant Analysis; Voice Activity Detection; entropy and MCE;
Conference_Titel :
Information and Communication Technologies: From Theory to Applications, 2008. ICTTA 2008. 3rd International Conference on
Conference_Location :
Damascus
Print_ISBN :
978-1-4244-1751-3
Electronic_ISBN :
978-1-4244-1752-0
DOI :
10.1109/ICTTA.2008.4530028