• DocumentCode
    1336048
  • Title

    Dynamic Spectral Envelope Modeling for Timbre Analysis of Musical Instrument Sounds

  • Author

    Burred, Juan José ; Röbel, Axel ; Sikora, Thomas

  • Author_Institution
    Anal./Synthesis Team, IRCAM, Paris, France
  • Volume
    18
  • Issue
    3
  • fYear
    2010
  • fDate
    3/1/2010 12:00:00 AM
  • Firstpage
    663
  • Lastpage
    674
  • Abstract
    We present a computational model of musical instrument sounds that focuses on capturing the dynamic behavior of the spectral envelope. A set of spectro-temporal envelopes belonging to different notes of each instrument are extracted by means of sinusoidal modeling and subsequent frequency interpolation, before being subjected to principal component analysis. The prototypical evolution of the envelopes in the obtained reduced-dimensional space is modeled as a nonstationary Gaussian Process. This results in a compact representation in the form of a set of prototype curves in feature space, or equivalently of prototype spectro-temporal envelopes in the time-frequency domain. Finally, the obtained models are successfully evaluated in the context of two music content analysis tasks: classification of instrument samples and detection of instruments in monaural polyphonic mixtures.
  • Keywords
    Gaussian processes; acoustic signal processing; feature extraction; interpolation; musical acoustics; musical instruments; principal component analysis; spectral analysis; computational model; dynamic spectral envelope modeling; envelope evolution; feature space; instrument detection; instrument sample classification; monaural polyphonic mixture; music content analysis; musical instrument sounds; nonstationary Gaussian process; principal component analysis; reduced-dimensional space; sinusoidal modeling; spectro-temporal envelope; subsequent frequency interpolation; timbre analysis; time-frequency domain; Gaussian processes; music information retrieval (MIR); sinusoidal modeling; spectral envelope; timbre model;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2009.2036300
  • Filename
    5337963