DocumentCode
1497083
Title
Adaptive Harmonic Spectral Decomposition for Multiple Pitch Estimation
Author
Vincent, Emmanuel ; Bertin, Nancy ; Badeau, Roland
Author_Institution
INRIA, Centre Inria Rennes-Bretagne Atlantique, Rennes, France
Volume
18
Issue
3
fYear
2010
fDate
3/1/2010 12:00:00 AM
Firstpage
528
Lastpage
537
Abstract
Multiple pitch estimation consists of estimating the fundamental frequencies and saliences of pitched sounds over short time frames of an audio signal. This task forms the basis of several applications in the particular context of musical audio. One approach is to decompose the short-term magnitude spectrum of the signal into a sum of basis spectra representing individual pitches scaled by time-varying amplitudes, using algorithms such as nonnegative matrix factorization (NMF). Prior training of the basis spectra is often infeasible due to the wide range of possible musical instruments. Appropriate spectra must then be adaptively estimated from the data, which may result in limited performance due to overfitting issues. In this paper, we model each basis spectrum as a weighted sum of narrowband spectra representing a few adjacent harmonic partials, thus enforcing harmonicity and spectral smoothness while adapting the spectral envelope to each instrument. We derive a NMF-like algorithm to estimate the model parameters and evaluate it on a database of piano recordings, considering several choices for the narrowband spectra. The proposed algorithm performs similarly to supervised NMF using pre-trained piano spectra but improves pitch estimation performance by 6% to 10% compared to alternative unsupervised NMF algorithms.
Keywords
audio signal processing; matrix decomposition; adaptive harmonic spectral decomposition; audio signal; multiple pitch estimation; musical instruments; nonnegative matrix factorization; pitch estimation; short-term magnitude spectrum; Adaptive representation; harmonicity; multiple pitch estimation; nonnegative matrix factorization; spectral smoothness;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2009.2034186
Filename
5282583
Link To Document