DocumentCode
848625
Title
Adapted One-versus-All Decision Trees for Data Stream Classification
Author
Hashemi, Sattar ; Yang, Ying ; Mirzamomen, Zahra ; Kangavari, Mohammadreza
Author_Institution
Monash Univ., Clayton, VIC
Volume
21
Issue
5
fYear
2009
fDate
5/1/2009 12:00:00 AM
Firstpage
624
Lastpage
637
Abstract
One versus all (OVA) decision trees learn k individual binary classifiers, each one to distinguish the instances of a single class from the instances of all other classes. Thus OVA is different from existing data stream classification schemes whose majority use multiclass classifiers, each one to discriminate among all the classes. This paper advocates some outstanding advantages of OVA for data stream classification. First, there is low error correlation and hence high diversity among OVA´s component classifiers, which leads to high classification accuracy. Second, OVA is adept at accommodating new class labels that often appear in data streams. However, there also remain many challenges to deploy traditional OVA for classifying data streams. First, as every instance is fed to all component classifiers, OVA is known as an inefficient model. Second, OVA´s classification accuracy is adversely affected by the imbalanced class distribution in data streams. This paper addresses those key challenges and consequently proposes a new OVA scheme that is adapted for data stream classification. Theoretical analysis and empirical evidence reveal that the adapted OVA can offer faster training, faster updating and higher classification accuracy than many existing popular data stream classification algorithms.
Keywords
data analysis; decision trees; learning (artificial intelligence); pattern classification; data stream classification; data training; error correlation; one versus all decision tree; Data mining; Machine learning;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2008.181
Filename
4609384
Link To Document