DocumentCode
698357
Title
Warped discrete cosine transform cepstrum: A new feature for speech processing
Author
Muralishankar, R. ; Sangwan, Abhijeet ; O´Shaughnessy, Douglas
Author_Institution
INRS-EMT (Telecommun.), Univ. of Quebec, Montreal, QC, Canada
fYear
2005
fDate
4-8 Sept. 2005
Firstpage
1
Lastpage
4
Abstract
In this paper, we propose a new feature for speech recognition and speaker identification application. The new feature is termed as warped-discrete cosine transform cepstrum (WDCTC). The feature is obtained by replacing the discrete cosine transform (DCT) by the warped discrete cosine transform (WDCT, [4]) in the discrete cosine tranform cepstrum (DCTC [2]). The WDCT is implemented as a cascade of the DCT and IIR all-pass filters. We incorporate a nonlinear frequency-scale in DCTC which closely follows the bark-scale. This is accomplished by setting the all-pass filter parameter using an expression given by Smith and Abel [5]. Performance of WDCTC is compared to mel-frequency cepstral coefficients (MFCC) in a speech recognition and speaker identification experiment. WDCTC outperforms MFCC in both noisy and noiseless conditions.
Keywords
IIR filters; all-pass filters; discrete cosine transforms; speaker recognition; IIR all-pass filters; MFCC; WDCTC; bark-scale; mel-frequency cepstral coefficients; nonlinear frequency-scale; speaker identification; speech processing; speech recognition; warped discrete cosine transform cepstrum; Cepstrum; Discrete cosine transforms; Mel frequency cepstral coefficient; Noise; Speech; Speech processing; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2005 13th European
Conference_Location
Antalya
Print_ISBN
978-160-4238-21-1
Type
conf
Filename
7077941
Link To Document