DocumentCode :
1707306
Title :
Optimized linear discriminant analysis for extracting robust speech features
Author :
Abbasian, H. ; Nasersharif, B. ; Akbari, A. ; Rahmani, M. ; Moin, M.S.
Author_Institution :
Dept. of Comput. Eng., Iran Univ. of Sci. & Technol., Tehran
fYear :
2008
Firstpage :
819
Lastpage :
824
Abstract :
Linear discriminant analysis (LDA) is a feature selection method in speech recognition. LDA finds transformations that maximizes the between-class scatter and minimizes within-class scatter. This transformation can be obtained in a class-dependent or class independent manner. In this paper, we propose a method to use class-dependent LDA for speech recognition and MFCC extraction. In addition, we propose a multidimensional genetic algorithm to optimize class- dependent LDA transformation matrix for robust MFCC extraction. For this purpose, we first use logarithm of clean speech Mel filter bank energies (LMFE) of each class to define within-class scatter and between-class scatter . Next, we obtain class-dependent LDA transformation matrix using multidimensional genetic algorithm (MGA) and use this matrix in place of DCT in MFCC feature extraction. The experimental results show that proposed speech recognition and optimization methods using class-dependent LDA, achieves to a significant isolated word recognition rate on Aurora2 database.
Keywords :
cepstral analysis; channel bank filters; feature extraction; genetic algorithms; matrix algebra; speech recognition; Aurora2 database; LMFE; MGA; Mel frequency cepstral coefficient; linear discriminant analysis; logarithm-Mel filter bank energy; multidimensional genetic algorithm; optimized LDA transformation matrix; robust MFCC feature extraction; speech recognition; Feature extraction; Filter bank; Genetic algorithms; Linear discriminant analysis; Mel frequency cepstral coefficient; Multidimensional systems; Robustness; Scattering; Speech analysis; Speech recognition; Class-dependent; Linear Discriminant analysis; MFCC; Multi-dimensional genetic algorithm; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications, Control and Signal Processing, 2008. ISCCSP 2008. 3rd International Symposium on
Conference_Location :
St Julians
Print_ISBN :
978-1-4244-1687-5
Electronic_ISBN :
978-1-4244-1688-2
Type :
conf
DOI :
10.1109/ISCCSP.2008.4537336
Filename :
4537336
Link To Document :
بازگشت