DocumentCode :
1897911
Title :
Integrating time alignment and neural networks for high performance continuous speech recognition
Author :
Haffner, Patrick ; Franzini, Michael ; Waibel, Alex
Author_Institution :
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear :
1991
fDate :
14-17 Apr 1991
Firstpage :
105
Abstract :
The authors describe two systems in which neural network classifiers are merged with dynamic programming (DP) time alignment methods to produce high-performance continuous speech recognizers. One system uses the connectionist Viterbi-training (CVT) procedure, in which a neural network with frame-level outputs is trained using guidance from a time alignment procedure. The other system uses multi-state time-delay neural networks (MS-TDNNs), in which embedded DP time alignment allows network training with only word-level external supervision. The CVT results on the, TI Digits are 99.1% word accuracy and 98.0% string accuracy. The MS-TDNNs are described in detail, with attention focused on their architecture, the training procedure, and results of applying the MS-TDNNs to continuous speaker-dependent alphabet recognition: on two speakers, word accuracy is respectively 97.5% and 89.7%
Keywords :
delays; dynamic programming; neural nets; speech recognition; TI Digits; architecture; connectionist Viterbi-training; continuous speech recognition; dynamic programming; frame-level outputs; multistate time delay neural networks; network training; neural network classifiers; speaker-dependent alphabet recognition; string accuracy; time alignment methods; word accuracy; word-level external supervision; Application software; Computer science; Delay effects; Dynamic programming; Hidden Markov models; Neural networks; Pattern classification; Speech recognition; US Department of Defense; Viterbi algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location :
Toronto, Ont.
ISSN :
1520-6149
Print_ISBN :
0-7803-0003-3
Type :
conf
DOI :
10.1109/ICASSP.1991.150289
Filename :
150289
Link To Document :
بازگشت