DocumentCode :
2960959
Title :
Gaussian process regression for voice activity detection and speech enhancement
Author :
Park, Sunho ; Choi, Seungjin
Author_Institution :
Dept. of Comput. Sci., Pohang Univ. of Sci. & Technol., Pohang
fYear :
2008
fDate :
1-8 June 2008
Firstpage :
2879
Lastpage :
2882
Abstract :
Gaussian process (GP) model is a flexible nonparametric Bayesian method that is widely used in regression and classification. In this paper we present a probabilistic method where we solve voice activity detection (VAD) and speech enhancement in a single framework of GP regression, modeling clean speech by a GP smoother. Optimized hyperparameters in GP models lead us to a novel VAD method since learned length-scale parameters in covariance functions are much different between voiced and unvoiced frames. Clean speech is estimated by posterior means in GP models. Numerical experiments confirm the validity of our method.
Keywords :
Bayes methods; Gaussian processes; nonparametric statistics; probability; regression analysis; signal detection; speech enhancement; Gaussian process regression; covariance functions; nonparametric Bayesian method; optimized hyperparameters; probabilistic method; signal classification; speech enhancement; voice activity detection; Bayesian methods; Gaussian noise; Gaussian processes; Kernel; Random processes; Signal processing; Speech enhancement; Speech processing; Wiener filter; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
Conference_Location :
Hong Kong
ISSN :
1098-7576
Print_ISBN :
978-1-4244-1820-6
Electronic_ISBN :
1098-7576
Type :
conf
DOI :
10.1109/IJCNN.2008.4634203
Filename :
4634203
Link To Document :
بازگشت