DocumentCode :
2016948
Title :
High performance Chinese Spoken Term Detection based on term expansion
Author :
Li, Wei ; Wu, Ji ; Lv, Ping
Author_Institution :
Dept. Electron. Eng., Tsinghua Univ., Beijing, China
fYear :
2010
fDate :
Nov. 29 2010-Dec. 3 2010
Firstpage :
430
Lastpage :
434
Abstract :
This paper mainly focuses on improving the performance of Chinese Spoken Term Detection (S TD) systems using words as searching units. These systems are designed to find instances of particular phrases (called Terms) in voices. Terms are usually segmented into word sequences and searched with voices´ recognition results. Mismatches between recognition results and word-segmentation might affect their performance. To solve this problem, two algorithms are designed to expand the searching spaces. Th e exp anded algorithms improve systems´ performance while lead to a side-effect for its efficiency. To speed up the retrieval tasks, the Finite State Automation (FSA) is used. A token-passing algorithm is the n developed for fast search. Experiments have shown that the proposed term expansion method could effectively improve the STD system´s performance. And using FSA with token-passing algorithm to search could effectively improve searching efficiency.
Keywords :
finite state machines; natural language processing; search problems; speech recognition; word processing; expanded algorithm; finite state automation; high performance Chinese spoken term detection; searching efficiency; term expansion; token passing algorithm; voice recognition; word segmentation; word sequences; Algorithm design and analysis; Conferences; Minimization; Redundancy; Speech; Speech recognition; Text recognition; Chinese Spoken Term Detection; Finite State Automation; term expansion; token passing; word segmentation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-6244-5
Type :
conf
DOI :
10.1109/ISCSLP.2010.5684852
Filename :
5684852
Link To Document :
بازگشت