• DocumentCode
    1784033
  • Title

    Research on Topic Detection Strategy Based on Extension of Comments and HowNet Lexeme

  • Author

    Yun Liu ; Xiaoxian Li ; Bing Zhao

  • Author_Institution
    Dept. of Electron. & Inf. Eng., Beijing Jiaotong Univ., Beijing, China
  • fYear
    2014
  • fDate
    27-29 Aug. 2014
  • Firstpage
    920
  • Lastpage
    923
  • Abstract
    As a product of Web2.0, micro-blog is developing rapidly these years. More and more information spread on the micro-blog because of its high speed and convenience, social hotspots and news events included. As a result, discovering, extraction and analyzing information become researching hotspots. By studying micro-blog text and long text cluster, this article draws a conclusion that traditional cluster algorithms cannot be used to discover topics because of the length of text. Therefore, this article proposes a solution which is based on the extension of the comments and HowNet lexeme. By this method, the short text and diversified expression can be overcome. Finally, the simulation results show that the proposed algorithm would significantly diminish the bad effects which are the results of short-text and improve the accuracy of clustering results.
  • Keywords
    Internet; pattern clustering; social networking (online); text analysis; HowNet Lexeme; Web 2.0; comment extension; microblog text cluster; short-text cluster; social hotspots; topic detection strategy; Accuracy; Clustering algorithms; Data mining; Internet; Probability; Semantics; Vectors; Microblogging short text; clustering algorithm; hot topics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP), 2014 Tenth International Conference on
  • Conference_Location
    Kitakyushu
  • Print_ISBN
    978-1-4799-5389-9
  • Type

    conf

  • DOI
    10.1109/IIH-MSP.2014.231
  • Filename
    6998477