Two novel FDLP based feature extraction methods for improvement of speech recognition

Author

Shekofteh, Yasser ; Almasganj, Farshad ; Rezaei, Ahmadreza ; Goodarzi, Mohammad Mohsen

Author_Institution

Biomed. Eng. Fac., Amirkabir Univ. of Technol., Tehran, Iran

fYear

2010

fDate

4-6 Dec. 2010

Firstpage

600

Lastpage

603

Abstract

In conventional automatic speech recognition systems, linguistic information of the speech signal are usually acquired from short-time frames about 10-30 ms. In this paper we have proposed two novel methods extracting the long-term information of the speech signal. Both of the methods are based on “sub-band FDLP” which divides the long-time frame of signal into several sub-bands. Using the MFCC algorithm, we are able to represent the long-term temporal features of the each sub-band. Our results show that the proposed methods could improve the recognition ratio by %1.73. The proposed methods were evaluated using the FarsDat database and the method´s robustness against different conditions of noise was experimented.

Keywords

feature extraction; speech recognition; FarsDat database; MFCC algorithm; feature extraction methods; linguistic information; speech recognition; subband FDLP; Discrete cosine transforms; Feature extraction; Mel frequency cepstral coefficient; Prediction algorithms; Signal to noise ratio; Speech; Speech recognition; Feature extraction; Linear predictive coding; Speech recognition; Time-frequency analysis;

fLanguage

English

Publisher

ieee

Conference_Titel

Telecommunications (IST), 2010 5th International Symposium on

Conference_Location

Tehran

Print_ISBN

978-1-4244-8183-5

Type

conf

DOI

10.1109/ISTEL.2010.5734095

Filename

5734095