DocumentCode
3109525
Title
Semantic Chunk Annotation for questions using Maximum Entropy
Author
Fan, Shixi ; Zhang, Yaoyun ; Ng, Wing W Y ; Wang, Xuan ; Wang, Xiaolong
Author_Institution
Shenzhen Grad. Sch., Harbin Inst. of Technol., Shenzhen
fYear
2008
fDate
12-15 Oct. 2008
Firstpage
450
Lastpage
454
Abstract
We present a ME (Maximum Entropy) model for Semantic Chunk Annotation in a Chinese Question and Answer (Q&A) system. The model was derived from a corpus of real world questions, which are collected from some discussion groups on the Internet. The questions are supposed to be answered by other people, so the questions are very complex. The semantic chunks were introduced. Feature for the model was described and MI (mutual information) was adopted for feature selection. The training data consists of 14000 sentences and the test data consists of 4000 sentences. The result: F-score is 90.68%.
Keywords
maximum entropy methods; query processing; search engines; semantic Web; Chinese Question and Answer system; Internet; Semantic Chunk Annotation; feature selection; maximum entropy model; mutual information; Computer architecture; Computer science; Databases; Entropy; Information retrieval; Internet; Mutual information; Natural languages; Search engines; Testing; Maximum Entropy; Mutual information; Q&A; Semantic Chunk Annotation;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man and Cybernetics, 2008. SMC 2008. IEEE International Conference on
Conference_Location
Singapore
ISSN
1062-922X
Print_ISBN
978-1-4244-2383-5
Electronic_ISBN
1062-922X
Type
conf
DOI
10.1109/ICSMC.2008.4811317
Filename
4811317
Link To Document