Title :
Learning Vector Quantization and Neural Predictive Coding for nonlinear speech feature extraction
Author :
Chetouani, M. ; Gas, B. ; Zarader, J.L.
Author_Institution :
Lab. des Instrum. et Syst. d´Ile-De-France, Univ. Paris VI, Paris, France
Abstract :
Speech recognition is a special field of pattern recognition. In order to improve the performances of the systems, one can opt for several ways and among them the design of a feature extractor. This paper presents a new nonlinear feature extraction method based on the Learning Vector Quantization (LVQ) and the Neural Predictive Coding (NPC). The key idea of this work is to design a feature extractor, the NPC, by the introduction of discriminant constraint provided by the LVQ classifier. The performances are estimated on a phoneme classification task by several methods: GMM, MLP, LVQ. The phonemes are extracted from the NTIMIT database. We make comparisons with linear and nonlinear feature transformation methods (LDA, PCA, NLDA, NPCA), and also with coding methods (LPC, MFCC, PLP).
Keywords :
Gaussian processes; feature extraction; mixture models; signal classification; speech coding; vector quantisation; GMM; Gaussians mixture model; LVQ classifier; NTIMIT database; learning vector quantization; neural predictive coding; nonlinear feature transformation method; nonlinear speech feature extraction method; phoneme classification task; Abstracts; Adaptation models; Databases; Loss measurement; Mel frequency cepstral coefficient; Predictive models; Speech;
Conference_Titel :
Signal Processing Conference, 2004 12th European
Conference_Location :
Vienna
Print_ISBN :
978-320-0001-65-7