• DocumentCode
    1135727
  • Title

    Monaural Musical Sound Separation Based on Pitch and Common Amplitude Modulation

  • Author

    Li, Yipeng ; Woodruff, John ; Wang, DeLiang

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
  • Volume
    17
  • Issue
    7
  • fYear
    2009
  • Firstpage
    1361
  • Lastpage
    1371
  • Abstract
    Monaural musical sound separation has been extensively studied recently. An important problem in separation of pitched musical sounds is the estimation of time-frequency regions where harmonics overlap. In this paper, we propose a sinusoidal modeling-based separation system that can effectively resolve overlapping harmonics. Our strategy is based on the observations that harmonics of the same source have correlated amplitude envelopes and that the change in phase of a harmonic is related to the instrument´s pitch. We use these two observations in a least squares estimation framework for separation of overlapping harmonics. The system directly distributes mixture energy for harmonics that are unobstructed by other sources. Quantitative evaluation of the proposed system is shown when ground truth pitch information is available, when rough pitch estimates are provided in the form of a MIDI score, and finally, when a multi pitch tracking algorithm is used. We also introduce a technique to improve the accuracy of rough pitch estimates. Results show that the proposed system significantly outperforms related monaural musical sound separation systems.
  • Keywords
    amplitude modulation; audio signal processing; frequency estimation; harmonic analysis; least squares approximations; music; source separation; time-frequency analysis; tracking; MIDI score; common amplitude modulation; harmonics overlap separation; least squares estimation framework; monaural musical sound separation; multi pitch tracking algorithm; pitch amplitude modulation; sinusoidal modeling; time-frequency region estimation; Amplitude modulation; Computer science; Image analysis; Independent component analysis; Instruments; Least squares approximation; Music information retrieval; Psychoacoustic models; Signal processing algorithms; Sparse matrices; Common amplitude modulation (CAM); musical sound separation; sinusoidal modeling; time–frequency masking; underdetermined sound separation;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2009.2020886
  • Filename
    5165119