DocumentCode
1135727
Title
Monaural Musical Sound Separation Based on Pitch and Common Amplitude Modulation
Author
Li, Yipeng ; Woodruff, John ; Wang, DeLiang
Author_Institution
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
Volume
17
Issue
7
fYear
2009
Firstpage
1361
Lastpage
1371
Abstract
Monaural musical sound separation has been extensively studied recently. An important problem in separation of pitched musical sounds is the estimation of time-frequency regions where harmonics overlap. In this paper, we propose a sinusoidal modeling-based separation system that can effectively resolve overlapping harmonics. Our strategy is based on the observations that harmonics of the same source have correlated amplitude envelopes and that the change in phase of a harmonic is related to the instrument´s pitch. We use these two observations in a least squares estimation framework for separation of overlapping harmonics. The system directly distributes mixture energy for harmonics that are unobstructed by other sources. Quantitative evaluation of the proposed system is shown when ground truth pitch information is available, when rough pitch estimates are provided in the form of a MIDI score, and finally, when a multi pitch tracking algorithm is used. We also introduce a technique to improve the accuracy of rough pitch estimates. Results show that the proposed system significantly outperforms related monaural musical sound separation systems.
Keywords
amplitude modulation; audio signal processing; frequency estimation; harmonic analysis; least squares approximations; music; source separation; time-frequency analysis; tracking; MIDI score; common amplitude modulation; harmonics overlap separation; least squares estimation framework; monaural musical sound separation; multi pitch tracking algorithm; pitch amplitude modulation; sinusoidal modeling; time-frequency region estimation; Amplitude modulation; Computer science; Image analysis; Independent component analysis; Instruments; Least squares approximation; Music information retrieval; Psychoacoustic models; Signal processing algorithms; Sparse matrices; Common amplitude modulation (CAM); musical sound separation; sinusoidal modeling; time–frequency masking; underdetermined sound separation;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2009.2020886
Filename
5165119
Link To Document