DocumentCode
610337
Title
Graph stream classification using labeled and unlabeled graphs
Author
Shirui Pan ; Xingquan Zhu ; Chengqi Zhang ; Yu, Philip S.
Author_Institution
Centre for Quantum Comput. & Intell. Syst., Univ. of Technol., Sydney, NSW, Australia
fYear
2013
fDate
8-12 April 2013
Firstpage
398
Lastpage
409
Abstract
Graph classification is becoming increasingly popular due to the rapidly rising applications involving data with structural dependency. The wide spread of the graph applications and the inherent complex relationships between graph objects have made the labels of the graph data expensive and/or difficult to obtain, especially for applications involving dynamic changing graph records. While labeled graphs are limited, the copious amounts of unlabeled graphs are often easy to obtain with trivial efforts. In this paper, we propose a framework to build a stream based graph classification model by combining both labeled and unlabeled graphs. Our method, called gSLU, employs an ensemble based framework to partition graph streams into a number of graph chunks each containing some labeled and unlabeled graphs. For each individual chunk, we propose a minimum-redundancy subgraph feature selection module to select a set of informative subgraph features to build a classifier. To tackle the concept drifting in graph streams, an instance level weighting mechanism is used to dynamically adjust the instance weight, through which the subgraph feature selection can emphasize on difficult graph samples. The classifiers built from different graph chunks form an ensemble for graph stream classification. Experiments on real-world graph streams demonstrate clear benefits of using minimum-redundancy subgraph features to build accurate classifiers. By employing instance level weighting, our graph ensemble model can effectively adapt to the concept drifting in the graph stream for classification.
Keywords
graph theory; pattern classification; redundancy; complex graph object relationship; dynamic changing graph record; ensemble based framework; gSLU; graph chunk; graph stream classification; informative subgraph feature; instance level weighting mechanism; labeled graph; partition graph stream; redundancy subgraph feature selection module; structural dependency; unlabeled graph; Accuracy; Communities; Correlation; Feature extraction; Learning systems; Redundancy; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering (ICDE), 2013 IEEE 29th International Conference on
Conference_Location
Brisbane, QLD
ISSN
1063-6382
Print_ISBN
978-1-4673-4909-3
Electronic_ISBN
1063-6382
Type
conf
DOI
10.1109/ICDE.2013.6544842
Filename
6544842
Link To Document