DocumentCode
130978
Title
Finding dimensions for text based on heterogeneous information network
Author
Fei Jiang ; Xiaoguang Hong ; Zhaohui Peng ; Qingzhong Li
Author_Institution
Sch. of Comput. Sci. & Technol., Shandong Univ., Jinan, China
fYear
2014
fDate
27-29 June 2014
Firstpage
819
Lastpage
823
Abstract
We propose an approach applicable in the problem of multi dimensions text mining that finds out several sets of phrases which were referred to as the text dimension. Based on the dimensions of text found by the proposed approach, a network could be built by similarities between documents. A method is proposed to transform the network from a coarse-grained one to a fine-grained one. By repeatedly mining phrases sets from the networks of different granularities, we could get a refined text dimensions set. We provide experimental results on text mining showing the computational feasibility and effectiveness for finding text dimensions which combines text mining with network mining and can be used for learning interesting knowledge.
Keywords
data mining; information networks; text analysis; heterogeneous information network; multidimensions text mining; text dimension; Clustering algorithms; Communities; Data mining; Databases; Feature extraction; Image edge detection; Partitioning algorithms; heterogeneous information network; information network analysis method; network mining; text dimension;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Engineering and Service Science (ICSESS), 2014 5th IEEE International Conference on
Conference_Location
Beijing
ISSN
2327-0586
Print_ISBN
978-1-4799-3278-8
Type
conf
DOI
10.1109/ICSESS.2014.6933692
Filename
6933692
Link To Document