DocumentCode
2859983
Title
Combining Topic Models and Social Networks for Chat Data Mining
Author
Tuulos, Ville H. ; Tirri, Henry
Author_Institution
Helsinki Institute for Information Technology, Finland
fYear
2004
fDate
20-24 Sept. 2004
Firstpage
206
Lastpage
213
Abstract
Informal chat-room conversations have intrinsically different properties from regular static document collections. Noise, concise expressions and dynamic, changing and interleaving nature of discussions make chat data ill-suited for analysis with an off-the-shelf text mining method. On the other hand, interactive human communication has some implicit features which may be used to enhance the results. In our research we infer social network structures from the chat data by using a few basic heuristics. We then present some preliminary results showing that the inferred social graph may be used to enhance topic identification of a chat room when combined with a state-of-the-art topic and classification models. For validation purposes we then compare the performance effects of using this social information in a topic classification task.
Keywords
Data analysis; Data mining; Humans; Information analysis; Information technology; Interleaved codes; Internet; Social network services; Text mining; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence, 2004. WI 2004. Proceedings. IEEE/WIC/ACM International Conference on
Print_ISBN
0-7695-2100-2
Type
conf
DOI
10.1109/WI.2004.10025
Filename
1410805
Link To Document