Title :
Gaussian process regression for voice activity detection and speech enhancement
Author :
Park, Sunho ; Choi, Seungjin
Author_Institution :
Dept. of Comput. Sci., Pohang Univ. of Sci. & Technol., Pohang
Abstract :
Gaussian process (GP) model is a flexible nonparametric Bayesian method that is widely used in regression and classification. In this paper we present a probabilistic method where we solve voice activity detection (VAD) and speech enhancement in a single framework of GP regression, modeling clean speech by a GP smoother. Optimized hyperparameters in GP models lead us to a novel VAD method since learned length-scale parameters in covariance functions are much different between voiced and unvoiced frames. Clean speech is estimated by posterior means in GP models. Numerical experiments confirm the validity of our method.
Keywords :
Bayes methods; Gaussian processes; nonparametric statistics; probability; regression analysis; signal detection; speech enhancement; Gaussian process regression; covariance functions; nonparametric Bayesian method; optimized hyperparameters; probabilistic method; signal classification; speech enhancement; voice activity detection; Bayesian methods; Gaussian noise; Gaussian processes; Kernel; Random processes; Signal processing; Speech enhancement; Speech processing; Wiener filter; Working environment noise;
Conference_Titel :
Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-1820-6
Electronic_ISBN :
1098-7576
DOI :
10.1109/IJCNN.2008.4634203