DocumentCode :
591902
Title :
Incorporating syllable duration into line-detection-based spoken term detection
Author :
Ohno, Tetsufumi ; Akiba, Tatsuro
Author_Institution :
Dept. of Comput. Sci. & Eng., Toyohashi Univ. of Technol., Toyohashi, Japan
fYear :
2012
fDate :
2-5 Dec. 2012
Firstpage :
204
Lastpage :
209
Abstract :
A conventional method for spoken term detection (STD) is to apply approximate string matching to subword sequences in a spoken document obtained by speech recognition. An STD method that considers string matching as line detection in a syllable distance plane has been proposed. While this has demonstrated fast ordered-by-distance detections, it has still suffered from the insertion and deletion errors introduced by the speech recognition. In this work, we aim to improve detection performance by employing syllable-duration information. The proposed method enables robust detection by introducing a distance plane that uses frames as units instead of using syllables as units. Our experimental evaluation showed that the incorporation of syllable-duration achieved higher detection performance in high-recall regions.
Keywords :
speech recognition; string matching; word processing; STD method; approximate string matching; deletion errors; high-recall regions; insertion errors; line detection-based spoken term detection; ordered-by-distance detection; speech recognition; spoken document; subword sequences; syllable distance plane; syllable duration information; Equations; Estimation error; Hidden Markov models; Indexing; Mathematical model; Speech recognition; Approximate String Matching; Spoken Term Detection; Syllable Duration;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2012 IEEE
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4673-5125-6
Electronic_ISBN :
978-1-4673-5124-9
Type :
conf
DOI :
10.1109/SLT.2012.6424223
Filename :
6424223
Link To Document :
بازگشت