• DocumentCode
    2017731
  • Title

    Spectral trajectory estimation using nonnegative matrix factorization for model-based monaural speech separation

  • Author

    Mak, Chun-Man ; Lee, Tan ; Lee, S.W.

  • Author_Institution
    Dept. of Electron. Eng., Chinese Univ. of Hong Kong, Hong Kong, China
  • fYear
    2010
  • fDate
    Nov. 29 2010-Dec. 3 2010
  • Firstpage
    23
  • Lastpage
    28
  • Abstract
    This paper presents a study on model-based speech separation for monaural speech mixture. With prior knowledge about of the text content of the speech sources, we estimate the spectral envelope trajectory of each target source and use them to filter the mixture signal so that the target signal is enhanced and the interfering signal is suppressed. Accurate trajectory estimation is therefore crucial for successful separation. We proposed to use the nonnegative matrix factorization in the trajectory estimation process which improves the accuracy of the estimated trajectories considerably. Performance evaluation is carried out using mixtures of two equally-loud Cantonese speech sources. The proposed method is found to have significant improvement over previously proposed speech separation methods.
  • Keywords
    matrix decomposition; performance evaluation; signal processing; speech processing; Cantonese speech source; interfering signal; mixture signal; model based monaural speech separation; nonnegative matrix factorization; performance evaluation; spectral envelope trajectory; spectral trajectory estimation; speech source; text content; Estimation; Hidden Markov models; Mel frequency cepstral coefficient; Signal to noise ratio; Speech; Trajectory; Speech separation; component; nonnegative matrix factorization; spectral envelope trajectory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
  • Conference_Location
    Tainan
  • Print_ISBN
    978-1-4244-6244-5
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2010.5684883
  • Filename
    5684883