Title :
Cross-site combination and evaluation of subword spoken term detection systems
Author :
Mertens, Timo ; Wallace, Roy ; Schneider, Daniel
Author_Institution :
Dept. of Electron. & Telecommun., NTNU, Trondheim, Norway
Abstract :
The design and evaluation of subword-based spoken term detection (STD) systems depends on various factors, such as language, type of the speech to be searched and application scenario. The choice of the subword unit and search approach, however, is oftentimes made regardless of these factors. Therefore, we evaluate two subword STD systems across two data sets with varying properties to investigate the influence of different subword units on STD performance when working with different data types. Results show that on German broadcast news data, constrained search in syllable lattices is effective, whereas fuzzy phone lattice search is superior in more challenging English conversational telephone speech. By combining the key features of the two systems at an early stage, we achieve improvements in Figure of Merit of up to 13.4% absolute on the German data. We also show that the choice of the appropriate evaluation metric is crucial when comparing retrieval performances across systems.
Keywords :
fuzzy set theory; information retrieval; speech recognition; English conversational telephone speech; German broadcast news data; STD system; cross-site combination; fuzzy phone lattice search; subword spoken term detection system; syllable lattices; Accuracy; Decoding; Error analysis; Lattices; Measurement; Speech; Speech recognition;
Conference_Titel :
Content-Based Multimedia Indexing (CBMI), 2011 9th International Workshop on
Conference_Location :
Madrid
Print_ISBN :
978-1-61284-432-9
Electronic_ISBN :
1949-3983
DOI :
10.1109/CBMI.2011.5972521