DocumentCode :
180340
Title :
A novel cepstral representation for timbre modeling of sound sources in polyphonic mixtures
Author :
Zhiyao Duan ; Pardo, Bryan ; Daudet, Laurent
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Rochester, Rochester, NY, USA
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
7495
Lastpage :
7499
Abstract :
We propose a novel cepstral representation called the uniform discrete cepstrum (UDC) to represent the timbre of sound sources in a sound mixture. Different from ordinary cepstrum and MFCC which have to be calculated from the full magnitude spectrum of a source after source separation, UDC can be calculated directly from isolated spectral points that are likely to belong to the source in the mixture spectrum (e.g., non-overlapping harmonics of a harmonic source). Existing cepstral representations that have this property are discrete cepstrum and regularized discrete cepstrum, however, compared to the proposed UDC, they are not as effective and are more complex to compute. The key advantage of UDC is that it uses a more natural and locally adaptive regularizer to prevent it from overfitting the isolated spectral points. We derive the mathematical relations between these cepstral representations, and compare their timbre modeling performances in the task of instrument recognition in polyphonic audio mixtures. We show that UDC and its mel-scale variant MUDC significantly outperform all the other representations.
Keywords :
acoustic generators; acoustic radiators; audio signal processing; cepstral analysis; signal representation; source separation; adaptive regularizer; cepstral representation; instrument recognition; isolated spectral points; magnitude spectrum; mel-scale variant MUDC; mixture spectrum; polyphonic audio mixtures; regularized discrete cepstrum; sound mixture; sound sources; source separation; timbre modeling; uniform discrete cepstrum; Cepstrum; Harmonic analysis; Instruments; Mel frequency cepstral coefficient; Source separation; Timbre; Cepstrum; instrument recognition; polyphonic; timbre;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6855057
Filename :
6855057
Link To Document :
بازگشت