• DocumentCode
    2960959
  • Title

    Gaussian process regression for voice activity detection and speech enhancement

  • Author

    Park, Sunho ; Choi, Seungjin

  • Author_Institution
    Dept. of Comput. Sci., Pohang Univ. of Sci. & Technol., Pohang
  • fYear
    2008
  • fDate
    1-8 June 2008
  • Firstpage
    2879
  • Lastpage
    2882
  • Abstract
    Gaussian process (GP) model is a flexible nonparametric Bayesian method that is widely used in regression and classification. In this paper we present a probabilistic method where we solve voice activity detection (VAD) and speech enhancement in a single framework of GP regression, modeling clean speech by a GP smoother. Optimized hyperparameters in GP models lead us to a novel VAD method since learned length-scale parameters in covariance functions are much different between voiced and unvoiced frames. Clean speech is estimated by posterior means in GP models. Numerical experiments confirm the validity of our method.
  • Keywords
    Bayes methods; Gaussian processes; nonparametric statistics; probability; regression analysis; signal detection; speech enhancement; Gaussian process regression; covariance functions; nonparametric Bayesian method; optimized hyperparameters; probabilistic method; signal classification; speech enhancement; voice activity detection; Bayesian methods; Gaussian noise; Gaussian processes; Kernel; Random processes; Signal processing; Speech enhancement; Speech processing; Wiener filter; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
  • Conference_Location
    Hong Kong
  • ISSN
    1098-7576
  • Print_ISBN
    978-1-4244-1820-6
  • Electronic_ISBN
    1098-7576
  • Type

    conf

  • DOI
    10.1109/IJCNN.2008.4634203
  • Filename
    4634203