DocumentCode
284577
Title
Connectionist word-level classification in speech recognition
Author
Haffner, Patrick
Author_Institution
CNET, Lannion, France
Volume
1
fYear
1992
fDate
23-26 Mar 1992
Firstpage
621
Abstract
MS-TDNN (multistate time delay neural networks), a connectionist architecture with embedded time alignment, was proposed recently. It makes word level classification possible and efficient on speech recognition tasks. Connectionist classification at the word level (rather than the usual maximum likelihood estimation) has not been commonly used in speech recognition, and raises issues related to proper temporal modeling and global discriminant training applied at the word level. The author shows how MS-TDNNs deal with these issues in a simple and efficient way, and achieve state of the art performance on several tasks representative of different problems in speaker-independent speech recognition: telephone digits and connected spelled letters
Keywords
delays; learning (artificial intelligence); neural nets; speech recognition; MS-TDNN; connected spelled letters; connectionist word-level classification; embedded time alignment; global discriminant training; multistate time delay neural networks; speech recognition; telephone digits; temporal modeling; training; Delay effects; Hidden Markov models; Intelligent networks; Maximum likelihood estimation; Neural networks; Power system modeling; Power system reliability; Speech recognition; Telephony; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location
San Francisco, CA
ISSN
1520-6149
Print_ISBN
0-7803-0532-9
Type
conf
DOI
10.1109/ICASSP.1992.225832
Filename
225832
Link To Document