• DocumentCode
    3404049
  • Title

    Harmonic-Temporal-Timbral Clustering (HTTC) for the analysis of multi-instrument polyphonic music signals

  • Author

    Miyamoto, Kenichi ; Kameoka, Hirokazu ; Nishimoto, Takuya ; Ono, Nobutaka ; Sagayama, Shigeki

  • Author_Institution
    Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    113
  • Lastpage
    116
  • Abstract
    In this paper, we discuss a new approach named Harmonic-Temporal-Timbral Clustering (HTTC) for the analysis of single- channel audio signal of multi-instrument polyphonic music to estimate the pitch, onset timing, power and duration of all the acoustic events and to classify them into timbre categories simultaneously. Each acoustic event is modeled by a harmonic structure and a smooth envelope both represented by Gaussian mixtures. Based on the similarity between these spectro- temporal structures, timbres are clustered to form timbre categories. The entire process is mathematically formulated as a minimization problem for the I-divergence between the HTTC parametric model and the observed spectrogram of the music audio signal to simultaneously update harmonic, temporal and timbral model parameters through the EM algorithm. Some experimental results are presented to discuss the performance of the algorithm.
  • Keywords
    Gaussian processes; audio signal processing; harmonic analysis; music; signal classification; EM algorithm; Gaussian mixture; HTTC approach; harmonic-temporal-timbral clustering; multiinstrument polyphonic music signal; pitch estimation; single-channel audio signal analysis; spectro-temporal structure; spectrogram; Harmonic analysis; Minimization methods; Multiple signal classification; Music; Parametric statistics; Signal analysis; Signal processing; Spectrogram; Timbre; Timing; EM algorithm; Harmonic-Temporal-Timbral Clustering (HTTC); analysis of multi-instrument music;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4517559
  • Filename
    4517559