Title :
Topic Model for Graph Mining
Author :
Junyu Xuan ; Jie Lu ; Guangquan Zhang ; Xiangfeng Luo
Author_Institution :
Centre for Quantum Comput. & Intell. Syst., Univ. of Technol., Sydney, Ultimo, NSW, Australia
Abstract :
Graph mining has been a popular research area because of its numerous application scenarios. Many unstructured and structured data can be represented as graphs, such as, documents, chemical molecular structures, and images. However, an issue in relation to current research on graphs is that they cannot adequately discover the topics hidden in graph-structured data which can be beneficial for both the unsupervised learning and supervised learning of the graphs. Although topic models have proved to be very successful in discovering latent topics, the standard topic models cannot be directly applied to graph-structured data due to the “bag-of-word” assumption. In this paper, an innovative graph topic model (GTM) is proposed to address this issue, which uses Bernoulli distributions to model the edges between nodes in a graph. It can, therefore, make the edges in a graph contribute to latent topic discovery and further improve the accuracy of the supervised and unsupervised learning of graphs. The experimental results on two different types of graph datasets show that the proposed GTM outperforms the latent Dirichlet allocation on classification by using the unveiled topics of these two models to represent graphs.
Keywords :
data mining; data structures; graph theory; pattern classification; text analysis; unsupervised learning; Bernoulli distribution; GTM; bag-of-word assumption; classification; edge modeling; graph dataset; graph mining; graph nodes; graph representation; graph supervised learning; graph-structured data; innovative graph topic model; latent Dirichlet allocation; latent topic discovery; unsupervised learning; Chemical elements; Chemicals; Data mining; Data models; Hidden Markov models; Inference algorithms; Vectors; Graph mining; latent Dirichlet allocation (LDA); topic model;
Journal_Title :
Cybernetics, IEEE Transactions on
DOI :
10.1109/TCYB.2014.2386282