• DocumentCode
    130978
  • Title

    Finding dimensions for text based on heterogeneous information network

  • Author

    Fei Jiang ; Xiaoguang Hong ; Zhaohui Peng ; Qingzhong Li

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Shandong Univ., Jinan, China
  • fYear
    2014
  • fDate
    27-29 June 2014
  • Firstpage
    819
  • Lastpage
    823
  • Abstract
    We propose an approach applicable in the problem of multi dimensions text mining that finds out several sets of phrases which were referred to as the text dimension. Based on the dimensions of text found by the proposed approach, a network could be built by similarities between documents. A method is proposed to transform the network from a coarse-grained one to a fine-grained one. By repeatedly mining phrases sets from the networks of different granularities, we could get a refined text dimensions set. We provide experimental results on text mining showing the computational feasibility and effectiveness for finding text dimensions which combines text mining with network mining and can be used for learning interesting knowledge.
  • Keywords
    data mining; information networks; text analysis; heterogeneous information network; multidimensions text mining; text dimension; Clustering algorithms; Communities; Data mining; Databases; Feature extraction; Image edge detection; Partitioning algorithms; heterogeneous information network; information network analysis method; network mining; text dimension;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering and Service Science (ICSESS), 2014 5th IEEE International Conference on
  • Conference_Location
    Beijing
  • ISSN
    2327-0586
  • Print_ISBN
    978-1-4799-3278-8
  • Type

    conf

  • DOI
    10.1109/ICSESS.2014.6933692
  • Filename
    6933692