• DocumentCode
    2761081
  • Title

    Two novel FDLP based feature extraction methods for improvement of speech recognition

  • Author

    Shekofteh, Yasser ; Almasganj, Farshad ; Rezaei, Ahmadreza ; Goodarzi, Mohammad Mohsen

  • Author_Institution
    Biomed. Eng. Fac., Amirkabir Univ. of Technol., Tehran, Iran
  • fYear
    2010
  • fDate
    4-6 Dec. 2010
  • Firstpage
    600
  • Lastpage
    603
  • Abstract
    In conventional automatic speech recognition systems, linguistic information of the speech signal are usually acquired from short-time frames about 10-30 ms. In this paper we have proposed two novel methods extracting the long-term information of the speech signal. Both of the methods are based on “sub-band FDLP” which divides the long-time frame of signal into several sub-bands. Using the MFCC algorithm, we are able to represent the long-term temporal features of the each sub-band. Our results show that the proposed methods could improve the recognition ratio by %1.73. The proposed methods were evaluated using the FarsDat database and the method´s robustness against different conditions of noise was experimented.
  • Keywords
    feature extraction; speech recognition; FarsDat database; MFCC algorithm; feature extraction methods; linguistic information; speech recognition; subband FDLP; Discrete cosine transforms; Feature extraction; Mel frequency cepstral coefficient; Prediction algorithms; Signal to noise ratio; Speech; Speech recognition; Feature extraction; Linear predictive coding; Speech recognition; Time-frequency analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Telecommunications (IST), 2010 5th International Symposium on
  • Conference_Location
    Tehran
  • Print_ISBN
    978-1-4244-8183-5
  • Type

    conf

  • DOI
    10.1109/ISTEL.2010.5734095
  • Filename
    5734095