DocumentCode :
2565122
Title :
Extracting Chinese question-answer pairs from online forums
Author :
Wang, Baoxun ; Liu, Bingquan ; Sun, Chengjie ; Wang, Xiaolong ; Sun, Lin
Author_Institution :
Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China
fYear :
2009
fDate :
11-14 Oct. 2009
Firstpage :
1159
Lastpage :
1164
Abstract :
Extracting question-answer pairs from online forums is a meaningful work due to the huge amount of valuable user generated resource contained in forums. In this paper we consider the problem of extracting Chinese question-answer pairs for the first time. We present a strategy to detect Chinese questions and their answers. We propose a sequential rule based method to find questions in a forum thread, then we adopt non-textual features based on forum structure to improve the performance of answer detecting in the same thread. Experimental results show that our techniques are very effective.
Keywords :
Internet; data mining; information retrieval; knowledge based systems; Chinese question-answer pairs extraction; nontextual feature; online forum thread; sequential rule based method; user generated resource; Computer science; Cybernetics; Data mining; Feature extraction; Humans; Natural languages; Sun; Testing; USA Councils; Yarn; classification; information extraction; labeled sequential rules; nontextual features; question answering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man and Cybernetics, 2009. SMC 2009. IEEE International Conference on
Conference_Location :
San Antonio, TX
ISSN :
1062-922X
Print_ISBN :
978-1-4244-2793-2
Electronic_ISBN :
1062-922X
Type :
conf
DOI :
10.1109/ICSMC.2009.5345956
Filename :
5345956
Link To Document :
بازگشت