Title :
Phonetic name matching for cross-lingual Spoken Sentence Retrieval
Author :
Ji, Heng ; Grishman, Ralph ; Wang, Wen
Author_Institution :
City Univ. of New York, New York, NY
Abstract :
Cross-lingual spoken sentence retrieval (CLSSR) remains a challenge, especially for queries including OOV words such as person names. This paper proposes a simple method of fuzzy matching between query names and phones of candidate audio segments. This approach has the advantage of avoiding some word decoding errors in automatic speech recognition (ASR). Experiments on Mandarin-English CLSSR show that phone-based searching and conventional translation-based searching are complementary. Adding phone matching achieved 26.29% improvement on F-measure over searching on state-of-the-art machine translation (MT) output and 8.83% over entity translation (ET) output.
Keywords :
fuzzy systems; language translation; natural language processing; pattern matching; query processing; speech processing; speech recognition; Mandarin-English CLSSR; OOV words; automatic speech recognition; candidate audio segments; cross-lingual spoken sentence retrieval; entity translation; fuzzy matching; machine translation; phonetic name matching; query names; word decoding errors; Automatic speech recognition; Broadcasting; Content based retrieval; Decoding; Error analysis; Information retrieval; Natural languages; Pipelines; Speech recognition; Text recognition; Information Retrieval; Speech Recognition;
Conference_Titel :
Spoken Language Technology Workshop, 2008. SLT 2008. IEEE
Conference_Location :
Goa
Print_ISBN :
978-1-4244-3471-8
Electronic_ISBN :
978-1-4244-3472-5
DOI :
10.1109/SLT.2008.4777895