DocumentCode
3404049
Title
Harmonic-Temporal-Timbral Clustering (HTTC) for the analysis of multi-instrument polyphonic music signals
Author
Miyamoto, Kenichi ; Kameoka, Hirokazu ; Nishimoto, Takuya ; Ono, Nobutaka ; Sagayama, Shigeki
Author_Institution
Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
113
Lastpage
116
Abstract
In this paper, we discuss a new approach named Harmonic-Temporal-Timbral Clustering (HTTC) for the analysis of single- channel audio signal of multi-instrument polyphonic music to estimate the pitch, onset timing, power and duration of all the acoustic events and to classify them into timbre categories simultaneously. Each acoustic event is modeled by a harmonic structure and a smooth envelope both represented by Gaussian mixtures. Based on the similarity between these spectro- temporal structures, timbres are clustered to form timbre categories. The entire process is mathematically formulated as a minimization problem for the I-divergence between the HTTC parametric model and the observed spectrogram of the music audio signal to simultaneously update harmonic, temporal and timbral model parameters through the EM algorithm. Some experimental results are presented to discuss the performance of the algorithm.
Keywords
Gaussian processes; audio signal processing; harmonic analysis; music; signal classification; EM algorithm; Gaussian mixture; HTTC approach; harmonic-temporal-timbral clustering; multiinstrument polyphonic music signal; pitch estimation; single-channel audio signal analysis; spectro-temporal structure; spectrogram; Harmonic analysis; Minimization methods; Multiple signal classification; Music; Parametric statistics; Signal analysis; Signal processing; Spectrogram; Timbre; Timing; EM algorithm; Harmonic-Temporal-Timbral Clustering (HTTC); analysis of multi-instrument music;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4517559
Filename
4517559
Link To Document