• DocumentCode
    2148302
  • Title

    A non-negative approach to semi-supervised separation of speech from noise with the use of temporal dynamics

  • Author

    Mysore, Gautham J. ; Smaragdis, Paris

  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    17
  • Lastpage
    20
  • Abstract
    We present a semi-supervised source separation methodology to denoise speech by modeling speech as one source and noise as the other source. We model speech using the recently pro posed non-negative hidden Markov model, which uses multiple non-negative dictionaries and a Markov chain to jointly model spectral structure and temporal dynamics of speech. We perform separation of the speech and noise using the recently proposed non-negative factorial hidden Markov model. Although the speech model is learned from training data, the noise model is learned during the separation process and re quires no training data. We show that the proposed method achieves superior results to using non-negative spectrogram factorization, which ignores the non-stationarity and temporal dynamics of speech.
  • Keywords
    hidden Markov models; signal denoising; source separation; speech processing; Markov chain; multiple nonnegative dictionaries; nonnegative approach; nonnegative factorial hidden Markov model; nonnegative hidden Markov model; semisupervised separation; semisupervised source separation; spectral structure; speech denoising; speech modeling; temporal dynamics; Dictionaries; Hidden Markov models; Noise; Noise reduction; Source separation; Spectrogram; Speech; Denoising; Semi-supervised source separation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5946317
  • Filename
    5946317