• DocumentCode
    180340
  • Title

    A novel cepstral representation for timbre modeling of sound sources in polyphonic mixtures

  • Author

    Zhiyao Duan ; Pardo, Bryan ; Daudet, Laurent

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Rochester, Rochester, NY, USA
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    7495
  • Lastpage
    7499
  • Abstract
    We propose a novel cepstral representation called the uniform discrete cepstrum (UDC) to represent the timbre of sound sources in a sound mixture. Different from ordinary cepstrum and MFCC which have to be calculated from the full magnitude spectrum of a source after source separation, UDC can be calculated directly from isolated spectral points that are likely to belong to the source in the mixture spectrum (e.g., non-overlapping harmonics of a harmonic source). Existing cepstral representations that have this property are discrete cepstrum and regularized discrete cepstrum, however, compared to the proposed UDC, they are not as effective and are more complex to compute. The key advantage of UDC is that it uses a more natural and locally adaptive regularizer to prevent it from overfitting the isolated spectral points. We derive the mathematical relations between these cepstral representations, and compare their timbre modeling performances in the task of instrument recognition in polyphonic audio mixtures. We show that UDC and its mel-scale variant MUDC significantly outperform all the other representations.
  • Keywords
    acoustic generators; acoustic radiators; audio signal processing; cepstral analysis; signal representation; source separation; adaptive regularizer; cepstral representation; instrument recognition; isolated spectral points; magnitude spectrum; mel-scale variant MUDC; mixture spectrum; polyphonic audio mixtures; regularized discrete cepstrum; sound mixture; sound sources; source separation; timbre modeling; uniform discrete cepstrum; Cepstrum; Harmonic analysis; Instruments; Mel frequency cepstral coefficient; Source separation; Timbre; Cepstrum; instrument recognition; polyphonic; timbre;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6855057
  • Filename
    6855057