• DocumentCode
    2066121
  • Title

    Order Adaptation of the Fractional Fourier Transform Using the Intraframe Pitch Change Rate for Speech Recognition

  • Author

    Yin, Hui ; Nadeu, Climent ; Hohmann, Volker ; Xie, Xiang ; Kuang, Jingming

  • Author_Institution
    Dept. of Electron. Eng., Beijing Inst. of Technol., Beijing, China
  • fYear
    2008
  • fDate
    16-19 Dec. 2008
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    We propose an acoustic feature for speech recognition based on the combination of MFCC and fractional Fourier transform (FrFT). The transform orders for FrFT are adaptively set according to the intraframe pitch change rate. This method is motivated by the fact that the speech is not stationary even in a short period of time, and the idea is shown using an AM-FM speech model and some spectrograms of an artificial periodic signal. Experiments were conducted on the intervocalic English consonants provided by Interspeech 2008 Consonant Challenge and a Mandarin connected digits corpus. The performance of the proposed method is compared with the MFCC baseline system. Experimental results show that the proposed features get a slightly better recognition rate than MFCCs presumably because they can better track the dynamic characteristics of the speech harmonics.
  • Keywords
    Fourier transforms; speech recognition; AM-FM speech model; FrFT; MFCC baseline system; Mel-frequency cepstral coefficients; fractional Fourier transform; intervocalic English consonants; intraframe pitch change rate; order adaptation; speech recognition; Auditory system; Automatic speech recognition; Chirp; Feature extraction; Fourier transforms; Mel frequency cepstral coefficient; Speech analysis; Speech processing; Speech recognition; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
  • Conference_Location
    Kunming
  • Print_ISBN
    978-1-4244-2942-4
  • Electronic_ISBN
    978-1-4244-2943-1
  • Type

    conf

  • DOI
    10.1109/CHINSL.2008.ECP.60
  • Filename
    4730314