• DocumentCode
    1987970
  • Title

    Subband Energy distance measure applied in multi-pass speech/non-speech discrimination

  • Author

    Chu, Wei ; Liu, Jia

  • Author_Institution
    Dept. of Electron. Eng., Tsinghua Univ., Beijing
  • fYear
    2007
  • fDate
    12-15 Feb. 2007
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    This paper proposes a novel Subband Energy (SBE) distance measure to describe the differences between heterogeneous segments, and applies it in multi-pass speech/non-speech discrimination. The first pass of the discrimination is a segmentation stage based on Bayesian Information Criterion (BIC). The second pass is a classification stage employing a Gaussian Mixture Model (GMM) classifier. The third pass is a post-processing procedure which is efficient in acquiring precise boundaries between heterogeneous segments using SBE distance measure. A front-end speech/non-speech discriminator is built to extract speech segments from the broadcast news data and provide these speech segments as input for the subsequent module. Experiments conducted on the National Broadcast News corpus have proved the feasibility and effectiveness of our method. The overall frame misclassification rate is controlled below 0.8%.
  • Keywords
    Bayes methods; Gaussian processes; information theory; pattern classification; speech processing; Bayesian information criterion; Gaussian mixture model classifier; heterogeneous segments; multipass nonspeech discrimination; multipass speech discrimination; subband energy distance measure; Acoustical engineering; Bayesian methods; Broadcasting; Data mining; Energy measurement; Hidden Markov models; Length measurement; Maximum likelihood detection; Maximum likelihood estimation; Speech analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Its Applications, 2007. ISSPA 2007. 9th International Symposium on
  • Conference_Location
    Sharjah
  • Print_ISBN
    978-1-4244-0778-1
  • Electronic_ISBN
    978-1-4244-1779-8
  • Type

    conf

  • DOI
    10.1109/ISSPA.2007.4555466
  • Filename
    4555466