• DocumentCode
    3239916
  • Title

    A Performance Analysis of Features from Complex Cepstra of Warped DST, DCT and DHT Filters for Phoneme Recognition

  • Author

    Muralishankar, R. ; Shankar, H.N. ; O´Shaughnessy, Douglas

  • Author_Institution
    PES Inst. of Technol., Bangalore
  • fYear
    2007
  • fDate
    1-4 July 2007
  • Firstpage
    591
  • Lastpage
    594
  • Abstract
    An analytical model has been developed for the warped discrete Hartley transform cepstrum (WDHTC) in a recent work. Along similar lines, the warped discrete cosine transform (WDCT) has since been modelled in a companion paper. These were preceded by empirical studies of the WDCT cepstrum (WDCTC) as applied to speech feature extraction for vowel recognition and speaker identification. In this paper, we derive the theoretical complex cepstrum (TCC) based on the warped discrete sine transform. We argue that the common recipe evolved through these papers may be used as a measure to compare analytically deducible front- end speech recognition schemes. In particular, we show that the WDCTC-based scheme outperforms the present warped discrete sine transform cepstrum (WDSTC)-based scheme and the one based on warped discrete Hartley transform in terms of low variance of features due to reduced spectral dynamic range. Phoneme recognition performance of WDCTC,WDHTC and WDSTC corroborate well with our analytical findings.
  • Keywords
    cepstral analysis; discrete Hartley transforms; discrete cosine transforms; discrete transforms; feature extraction; filtering theory; speech recognition; complex cepstra; performance analysis; phoneme recognition; speaker identification; speech feature extraction; speech recognition; vowel recognition; warped DCT filters; warped DHT filters; warped DST filters; warped discrete Hartley transform cepstrum; warped discrete cosine transform cepstrum; warped discrete sine transform cepstrum; Analytical models; Cepstral analysis; Cepstrum; Discrete cosine transforms; Discrete transforms; Feature extraction; Filters; Performance analysis; Speech analysis; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Signal Processing, 2007 15th International Conference on
  • Conference_Location
    Cardiff
  • Print_ISBN
    1-4244-0881-4
  • Electronic_ISBN
    1-4244-0882-2
  • Type

    conf

  • DOI
    10.1109/ICDSP.2007.4288651
  • Filename
    4288651