• DocumentCode
    2148222
  • Title

    Combining HMM-based melody extraction and NMF-based soft masking for separating voice and accompaniment from monaural audio

  • Author

    Wang, Yun ; Ou, Zhijian

  • Author_Institution
    Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Modern monaural voice and accompaniment separation systems usually consist of two main modules: melody extraction and time frequency masking. A main distinction between different separation systems lies in what approaches are used for the two modules. Popular techniques for melody extraction include hidden Markov models (HMMs) and non-negative matrix factorization (NMF), and masking includes hard and soft masking. This paper investigates the flaw of NMF-based melody extraction, and proposes the combination of HMM-based melody extraction (equipped with a newly-defined feature) and NMF-based soft masking. Evaluations on two publicly available databases show that the proposed system reaches state-of the-art performance and outperforms several other combinations.
  • Keywords
    audio signal processing; hidden Markov models; matrix decomposition; source separation; HMM-based melody extraction; NMF-based melody extraction; NMF-based soft masking; hard masking; hidden Markov models; monaural audio; monaural voice and accompaniment separation systems; nonnegative matrix factorization; time frequency masking; voice accompaniment; voice separation; Accuracy; Databases; Feature extraction; Hidden Markov models; Spectrogram; Time frequency analysis; Viterbi algorithm; HMMs; Monaural sound separation; NMF; melody extraction; soft masking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5946313
  • Filename
    5946313