DocumentCode :
284577
Title :
Connectionist word-level classification in speech recognition
Author :
Haffner, Patrick
Author_Institution :
CNET, Lannion, France
Volume :
1
fYear :
1992
fDate :
23-26 Mar 1992
Firstpage :
621
Abstract :
MS-TDNN (multistate time delay neural networks), a connectionist architecture with embedded time alignment, was proposed recently. It makes word level classification possible and efficient on speech recognition tasks. Connectionist classification at the word level (rather than the usual maximum likelihood estimation) has not been commonly used in speech recognition, and raises issues related to proper temporal modeling and global discriminant training applied at the word level. The author shows how MS-TDNNs deal with these issues in a simple and efficient way, and achieve state of the art performance on several tasks representative of different problems in speaker-independent speech recognition: telephone digits and connected spelled letters
Keywords :
delays; learning (artificial intelligence); neural nets; speech recognition; MS-TDNN; connected spelled letters; connectionist word-level classification; embedded time alignment; global discriminant training; multistate time delay neural networks; speech recognition; telephone digits; temporal modeling; training; Delay effects; Hidden Markov models; Intelligent networks; Maximum likelihood estimation; Neural networks; Power system modeling; Power system reliability; Speech recognition; Telephony; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
ISSN :
1520-6149
Print_ISBN :
0-7803-0532-9
Type :
conf
DOI :
10.1109/ICASSP.1992.225832
Filename :
225832
Link To Document :
بازگشت