DocumentCode
2960959
Title
Gaussian process regression for voice activity detection and speech enhancement
Author
Park, Sunho ; Choi, Seungjin
Author_Institution
Dept. of Comput. Sci., Pohang Univ. of Sci. & Technol., Pohang
fYear
2008
fDate
1-8 June 2008
Firstpage
2879
Lastpage
2882
Abstract
Gaussian process (GP) model is a flexible nonparametric Bayesian method that is widely used in regression and classification. In this paper we present a probabilistic method where we solve voice activity detection (VAD) and speech enhancement in a single framework of GP regression, modeling clean speech by a GP smoother. Optimized hyperparameters in GP models lead us to a novel VAD method since learned length-scale parameters in covariance functions are much different between voiced and unvoiced frames. Clean speech is estimated by posterior means in GP models. Numerical experiments confirm the validity of our method.
Keywords
Bayes methods; Gaussian processes; nonparametric statistics; probability; regression analysis; signal detection; speech enhancement; Gaussian process regression; covariance functions; nonparametric Bayesian method; optimized hyperparameters; probabilistic method; signal classification; speech enhancement; voice activity detection; Bayesian methods; Gaussian noise; Gaussian processes; Kernel; Random processes; Signal processing; Speech enhancement; Speech processing; Wiener filter; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
Conference_Location
Hong Kong
ISSN
1098-7576
Print_ISBN
978-1-4244-1820-6
Electronic_ISBN
1098-7576
Type
conf
DOI
10.1109/IJCNN.2008.4634203
Filename
4634203
Link To Document