Title :
Speech recognition using temporal decomposition and multi-layer feed-forward automata
Author :
Montaci, Claude ; Choukri, Khalid ; Chollet, Gurard
Author_Institution :
Dept. Signal, ENST, Paris, France
Abstract :
A report is presented of intraspeaker and interspeaker variability as a major source of error in automatic speech recognition. The authors report on two series of experiments using multilayer feed-forward automata (MLFFA) to control some aspects of this variability. The first series concerns the classification of spectral targets obtained from a robust implementation of temporal decomposition. An MLFFA accepts three successive targets to output an allophonic label. No improvement has been found so far from traditional classification techniques (i.e. k -nearest neighbors). In a second series of experiments spectral transformations using MLFFA are introduced for the adaptation to new speakers. Compared to linear techniques (multivariate regression and canonical correlation analysis), the MLFFA approach offers some improvement
Keywords :
speech recognition; allophonic label; automatic speech recognition; interspeaker variability; intraspeaker variability; multilayer feed-forward automata; spectral targets; spectral transformations; temporal decomposition; Automata; Automatic control; Automatic speech recognition; Feedforward systems; Frequency; Interpolation; Loudspeakers; Multivariate regression; Robustness; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
Conference_Location :
Glasgow
DOI :
10.1109/ICASSP.1989.266452