Title :
Feature Selection for Classifying Data Stream Based on Maximum Entropy
Author :
Liu, Yao-zong ; Wang, Yong-li ; Wei, Wei ; Zhang, Hong
Author_Institution :
Sch. of Comput., Nanjing Univ. of Sci. & Technol., Nanjing, China
Abstract :
Feature select ion is an important problem in the fields of machine learning and pat tern recognition. Data stream data classification with high dimensional and sparse, and the dimension of the need for compression, feature selection methods suitable for data stream classification study of very value of this area is currently a lack of in-depth study. This paper summarizes the current data flow classification feature selection research, analysis of the characteristics of different methods. Based on the principle of maximum entropy, naive Bayes with the technology on the data stream tuple feature selection attributes, divided into two different subsets of the merits, so as to enhance the work of C4.5 classifier results, the experiment proved not only StreamMEFS classification of time-saving, but also to improve the quality of the classification.
Keywords :
Bayes methods; data handling; learning (artificial intelligence); maximum entropy methods; pattern classification; data stream classification; feature selection; machine learning; maximum entropy; naive Bayes principle; pattern recognition; Costs; Electronic mail; Entropy; Feature extraction; Machine learning;
Conference_Titel :
Pattern Recognition, 2009. CCPR 2009. Chinese Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4244-4199-0
DOI :
10.1109/CCPR.2009.5344111