DocumentCode
3104034
Title
A Chinese Synonyms Reduced Algorithm Based on Sememe Tree
Author
Liguo, Duan ; Junjie, Chen ; Haifang, Li ; Aiping, Li
Author_Institution
Coll. of Comput. Sci. & Technol., Taiyuan Univ. of Technol., Taiyuan, China
fYear
2010
fDate
26-28 Sept. 2010
Firstpage
337
Lastpage
340
Abstract
Question Understanding of Chinese Question-Answering System generally includes steps such as: word segmentation, POS Tagging, keywords expansion, information retrieval etc. The extended keyword set usually has redundant messages and part of the words and phrases may be not relevant to the question. Consequently, information retrieval with the extended keywords set may bring about large numbers of noise information and enhance the difficulty of answer pick-up. This paper explores the use of distance between vocabularies in the sememe tree for reducing keywords set. It analyzes the detailed steps of question understanding and the improved algorithm. Empirical results support the theoretical findings. The algorithm proposed in the paper achieves substantial improvement by 23% on the average, and wipes off the vocabulary beside the mark. Furthermore, it will improve the accuracy rate of Question Understanding in the subsequent steps.
Keywords
information retrieval; natural language processing; vocabulary; Chinese question answering system; Chinese synonyms reduced algorithm; POS Tagging; information retrieval; keywords expansion; sememe tree; vocabulary; word segmentation; Accuracy; Dictionaries; Semantics; Syntactics; Tagging; Vocabulary; Chinese Question-Answering System; Question Understanding; reducing keywords set; sememe tree;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Aspects of Social Networks (CASoN), 2010 International Conference on
Conference_Location
Taiyuan
Print_ISBN
978-1-4244-8785-1
Type
conf
DOI
10.1109/CASoN.2010.82
Filename
5636724
Link To Document