DocumentCode
3416978
Title
Interactive query learning for isolated speech recognition
Author
Hwang, Jenq-Neng ; Li, Hang
Author_Institution
Dept. of Electr. Eng., Washington Univ., Seattle, WA, USA
fYear
1992
fDate
31 Aug-2 Sep 1992
Firstpage
93
Lastpage
102
Abstract
The authors propose an interactive query learning approach to isolated speech recognition tasks. The approach starts with training multiple `one-net-one-class´ time delay neural networks (TDNNs) based on sequences of LPC vectors. After all TDNNs are trained, initiated from each available LPC training sequence for one specific TDNN (say, class k ), an improved network inversion algorithm with imposing constraint is used to generate a set of inverted LPC sequences corresponding to various output values of the corresponding TDNN. By carefully listening to synthesized speech based on the inverted LPC sequences, a conjugate pair of LPC sequences is selected from the whole set of LPC sequences; one corresponds to the acceptable speech of class k and the other corresponds to the unacceptable speech of class k . This conjugate LPC sequence pair constitutes some parts of the classification boundary associated with this class, and should be further used as the training date to refine the already trained classifier boundary. A 6% accuracy improvement was achieved when the proposed method was tested on speaker independent E-set data
Keywords
learning (artificial intelligence); neural nets; speech recognition; LPC vectors sequences; conjugate LPC sequence pair; interactive query learning; isolated speech recognition; network inversion algorithm; speaker independent E-set data; synthesized speech; time delay neural networks; training; Automatic testing; Hidden Markov models; Information processing; Laboratories; Linear predictive coding; Network synthesis; Neural networks; Speech recognition; Speech synthesis; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks for Signal Processing [1992] II., Proceedings of the 1992 IEEE-SP Workshop
Conference_Location
Helsingoer
Print_ISBN
0-7803-0557-4
Type
conf
DOI
10.1109/NNSP.1992.253704
Filename
253704
Link To Document