Title :
Automatically Grouping Questions in Yahoo! Answers
Author :
Miao, Yajie ; Zhao, Lili ; Li, Chunping ; Tang, Jie
Author_Institution :
Sch. of Software, Tsinghua Univ., Beijing, China
fDate :
Aug. 31 2010-Sept. 3 2010
Abstract :
In this paper, we define and study a novel problem which is referred to as Community Question Grouping (CQG). Online QA services such as Yahoo! Answers contain large archives of community questions which are posted by users. Community Question Grouping is primarily concerned with grouping a collection of community questions into predefined categories. We first investigate the effectiveness of two basic methods, i.e., K-means and PLSA, in solving this problem. Then, both methods are extended in different ways to include user information. The experimental results with real datasets show that incorporation of user information improves the basic methods significantly. In addition, performance comparison reveals that PLSA with regularization is the most effective solution to the CQG problem.
Keywords :
Internet; portals; CQG problem; PLSA; Yahoo! Answers; community question grouping; online QA services; Community Question Grouping; Topic Model; Yahoo! Answers;
Conference_Titel :
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2010 IEEE/WIC/ACM International Conference on
Conference_Location :
Toronto, ON
Print_ISBN :
978-1-4244-8482-9
Electronic_ISBN :
978-0-7695-4191-4
DOI :
10.1109/WI-IAT.2010.157