DocumentCode
730741
Title
Improvements on transducing syllable lattice to word lattice for keyword search
Author
Hang Su ; Van Tung Pham ; Yanzhang He ; Hieronymus, James
Author_Institution
Int. Comput. Sci. Inst., Berkeley, CA, USA
fYear
2015
fDate
19-24 April 2015
Firstpage
4729
Lastpage
4733
Abstract
This paper investigates a weighted finite state transducer (WFST) based syllable decoding and transduction method for keyword search (KWS), and compares it with sub-word search and phone confusion methods in detail. Acoustic context dependent phone models are trained from word forced alignments and then used for syllable decoding and lattice generation. Out-of-vocabulary (OOV) keyword pronunciations are produced using a grapheme-to-syllable (G2S) system and then used to construct a lexical transducer. The lexical transducer is then composed with a keyword-boosted language model (LM) to transduce the syllable lattices to word lattices for final KWS. Word Error Rates (WER) and KWS results are reported for 5 different languages. It is shown that the syllable transduction method gives comparable KWS results to the syllable search and phone confusion methods. Combination of these three methods further improves OOV KWS performance.
Keywords
lattice theory; speech recognition; KWS; WFST; grapheme-to-syllable system; keyword search; keyword-boosted language model; multilingual speech; out-of-vocabulary keyword pronunciations; speech recognition; syllable decoding; syllable lattice; transduction method; weighted finite state transducer; word error rates; word lattice; Acoustics; Decoding; Keyword search; Lattices; Speech recognition; Training; Transducers; Keyword Search; OOV Keywords; Speech Recognition; Syllable Transduction; WFST;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7178868
Filename
7178868
Link To Document