DocumentCode
2018637
Title
Hybrid neural-network/HMM approaches to wordspotting
Author
Lippmann, Richard P. ; Singer, Elliot
Author_Institution
MIT Lincoln Lab., Lexington, MA, USA
Volume
1
fYear
1993
fDate
27-30 April 1993
Firstpage
565
Abstract
Two approaches to integrating neural network and hidden Markov model (HMM) algorithms into one hybrid wordspotter are being explored. One approach uses neural network secondary testing to analyze putative hits produced by a high-performance HMM wordspotter. This has provided consistent but small reductions in the number of false alarms required to obtain a given detection rate. In one set of experiments using the NIST Road Rally database, secondary testing reduced the false alarm rate by an average of 16.4%. A second approach uses radial basis function (RBF) neural networks to produce local machine scores for a Viterbi decoder. Network weights and RBF centers are trained at the word level to produce a high score for the correct keyword hits and a low score for false alarms generated by nonkeyword speech. Preliminary experiments using this approach are exploring a constructive approach which adds RBF centers to model nonkeyword near-misses and a cost function which attempts to maximize directly average detection accuracy over a specified range of false alarm rates.<>
Keywords
feedforward neural nets; hidden Markov models; learning (artificial intelligence); speech recognition; RBF centers; Viterbi decoder; cost function; detection accuracy; false alarms; hidden Markov model; neural network secondary testing; radial basis function; wordspotting;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location
Minneapolis, MN, USA
ISSN
1520-6149
Print_ISBN
0-7803-7402-9
Type
conf
DOI
10.1109/ICASSP.1993.319181
Filename
319181
Link To Document