DocumentCode :
667512
Title :
Perceptual Cepstral filters for speech and music processing
Author :
Mignot, Remi ; Valimaki, Vesa
Author_Institution :
Dept. of Signal Process. & Acoust., Aalto Univ., Espoo, Finland
fYear :
2013
fDate :
20-23 Oct. 2013
Firstpage :
1
Lastpage :
4
Abstract :
Source-filter modeling of speech or musical tones requires a filter model for the spectral envelope of the signal. To reduce the number of modeling parameters, one idea is the use of psychoacoustic knowledge to encode only the relevant information in a perceptual sense. Starting from an accurate estimation of the original spectral envelope, with imperceptible details, in this work, we propose to use its Mel-Frequency Cepstral Coefficient (MFCC) representation to catch the perceptually relevant information. Then, a new inverse process is presented to derive a smoother, but perceptually equivalent spectral envelope. For instance, this new method can be applied in speech coding, and thanks to the good properties of the MFCC representation, perceptual interpolations of sounds is made easier.
Keywords :
cepstral analysis; music; signal representation; speech coding; speech processing; MFCC representation; inverse process; mel-frequency cepstral coefficient representation; music processing; musical tones; perceptual cepstral filters; perceptual interpolations; psychoacoustic knowledge; speech coding; speech processing; speech source-filter modeling; Estimation; Mel frequency cepstral coefficient; Signal processing; Speech; Speech recognition; Timbre; Auditory perception; Cepstral representation; MFCC; Sound synthesis; Source-filter principle; Spectral envelope modeling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013 IEEE Workshop on
Conference_Location :
New Paltz, NY
ISSN :
1931-1168
Type :
conf
DOI :
10.1109/WASPAA.2013.6701858
Filename :
6701858
Link To Document :
بازگشت