Title :
Multipitch Estimation of Piano Music by Exemplar-Based Sparse Representation
Author :
Lee, Cheng-Te ; Yang, Yi-Hsuan ; Chen, Homer H.
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fDate :
6/1/2012 12:00:00 AM
Abstract :
Pitch, together with other midlevel music features such as rhythm and timbre, holds the promise of bridging the semantic gap between low-level features and high-level semantics for music understanding. This paper investigates the pitch estimation of a piano music signal by exemplar-based sparse representation. A note exemplar is a segment of a piano note, stored in the dictionary. We first describe how to represent a segment of the piano music signal as a linear combination of a small number of note exemplars from a large note exemplar dictionary and then show how the sparse representation problem can be solved by -regularized minimization. The proposed approach incorporates tuning factor estimation, note candidate selection, and hidden-Markov-model-based smoothing into the estimation process to improve accuracy. Unlike previous approaches, the proposed approach does not require retraining for a new piano. Instead, only a dozen notes of the new piano are needed. This feature is computationally attractive and avoids intense manual labeling. The system performance is evaluated using 70 classical music recordings of two real pianos under different recording conditions. The results show that the proposed system outperforms four state-of-the-art systems.
Keywords :
acoustic signal processing; content-based retrieval; hidden Markov models; music; musical instruments; sparse matrices; exemplar dictionary; exemplar-based sparse representation; hidden Markov model-based smoothing; high-level semantics; l1-regularized minimization; linear combination; low-level features; multipitch estimation; note candidate selection; piano music signal; piano note; recording conditions; system performance; tuning factor estimation; Accuracy; Dictionaries; Estimation; Harmonic analysis; Instruments; Multiple signal classification; Music; $l_{1}$-regularized minimization; Content retrieval; music transcription; pitch estimation; sparse representation;
Journal_Title :
Multimedia, IEEE Transactions on
DOI :
10.1109/TMM.2012.2191398