DocumentCode
2066121
Title
Order Adaptation of the Fractional Fourier Transform Using the Intraframe Pitch Change Rate for Speech Recognition
Author
Yin, Hui ; Nadeu, Climent ; Hohmann, Volker ; Xie, Xiang ; Kuang, Jingming
Author_Institution
Dept. of Electron. Eng., Beijing Inst. of Technol., Beijing, China
fYear
2008
fDate
16-19 Dec. 2008
Firstpage
1
Lastpage
4
Abstract
We propose an acoustic feature for speech recognition based on the combination of MFCC and fractional Fourier transform (FrFT). The transform orders for FrFT are adaptively set according to the intraframe pitch change rate. This method is motivated by the fact that the speech is not stationary even in a short period of time, and the idea is shown using an AM-FM speech model and some spectrograms of an artificial periodic signal. Experiments were conducted on the intervocalic English consonants provided by Interspeech 2008 Consonant Challenge and a Mandarin connected digits corpus. The performance of the proposed method is compared with the MFCC baseline system. Experimental results show that the proposed features get a slightly better recognition rate than MFCCs presumably because they can better track the dynamic characteristics of the speech harmonics.
Keywords
Fourier transforms; speech recognition; AM-FM speech model; FrFT; MFCC baseline system; Mel-frequency cepstral coefficients; fractional Fourier transform; intervocalic English consonants; intraframe pitch change rate; order adaptation; speech recognition; Auditory system; Automatic speech recognition; Chirp; Feature extraction; Fourier transforms; Mel frequency cepstral coefficient; Speech analysis; Speech processing; Speech recognition; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
Conference_Location
Kunming
Print_ISBN
978-1-4244-2942-4
Electronic_ISBN
978-1-4244-2943-1
Type
conf
DOI
10.1109/CHINSL.2008.ECP.60
Filename
4730314
Link To Document