DocumentCode
3850282
Title
Lattice Indexing for Spoken Term Detection
Author
Doğan Can;Murat Saraclar
Author_Institution
USC CS Dept., University of Southern California, Los Angeles
Volume
19
Issue
8
fYear
2011
Firstpage
2338
Lastpage
2347
Abstract
This paper considers the problem of constructing an efficient inverted index for the spoken term detection (STD) task. More specifically, we construct a deterministic weighted finite-state transducer storing soft-hits in the form of (utterance ID, start time, end time, posterior score) quadruplets. We propose a generalized factor transducer structure which retains the time information necessary for performing STD. The required information is embedded into the path weights of the factor transducer without disrupting the inherent optimality. We also describe how to index all substrings seen in a collection of raw automatic speech recognition lattices using the proposed structure. Our STD indexing/search implementation is built upon the OpenFst Library and is designed to scale well to large problems. Experiments on Turkish and English data sets corroborate our claims.
Keywords
"Automata","Transducers","Indexes","Lattices","Speech recognition","Speech","Strontium"
Journal_Title
IEEE Transactions on Audio, Speech, and Language Processing
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2011.2134087
Filename
5752829
Link To Document