• DocumentCode
    2859983
  • Title

    Combining Topic Models and Social Networks for Chat Data Mining

  • Author

    Tuulos, Ville H. ; Tirri, Henry

  • Author_Institution
    Helsinki Institute for Information Technology, Finland
  • fYear
    2004
  • fDate
    20-24 Sept. 2004
  • Firstpage
    206
  • Lastpage
    213
  • Abstract
    Informal chat-room conversations have intrinsically different properties from regular static document collections. Noise, concise expressions and dynamic, changing and interleaving nature of discussions make chat data ill-suited for analysis with an off-the-shelf text mining method. On the other hand, interactive human communication has some implicit features which may be used to enhance the results. In our research we infer social network structures from the chat data by using a few basic heuristics. We then present some preliminary results showing that the inferred social graph may be used to enhance topic identification of a chat room when combined with a state-of-the-art topic and classification models. For validation purposes we then compare the performance effects of using this social information in a topic classification task.
  • Keywords
    Data analysis; Data mining; Humans; Information analysis; Information technology; Interleaved codes; Internet; Social network services; Text mining; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence, 2004. WI 2004. Proceedings. IEEE/WIC/ACM International Conference on
  • Print_ISBN
    0-7695-2100-2
  • Type

    conf

  • DOI
    10.1109/WI.2004.10025
  • Filename
    1410805