Title :
Data Mining in census data with CART
Author :
Sheng, Bin ; Gengxin, Sun
Author_Institution :
Coll. of Inf. Sci. & Eng., Qingdao Univ., Qingdao, China
Abstract :
Census can provide the fundamental population data of the whole nation. The census data are rich with hidden information that can be used for the investigation of national conditions and national power. Data Mining aims at extract the implicit, previously unknown, and potentially useful knowledge from voluminous, non-complete, fuzzy, stochastic data. Using Data Mining in census data can make full use of these data to provide services for country´s social and economic development. Classification is one of the important Data Mining techniques. The Decision Trees of classification analysis can give high accuracy prediction results, and the output results are easy to understand. In this paper we use the Decision-Tree-Based classification model - CART to analyze the census data in Chengyang and Laixi, and classify the inhabitants, then evaluate the results, discuss the important significance for using Data Mining in census data.
Keywords :
data encapsulation; data mining; decision trees; demography; pattern classification; socio-economic effects; CART; Chengyang; Laixi; census data; classification analysis; classification and regression trees; data mining; decision trees; fuzzy data; hidden information; national conditions; national power; population data; social and economic development; stochastic data; Economics; USA Councils; CART; Census; Classification; Data Mining; Decision Tree;
Conference_Titel :
Advanced Computer Theory and Engineering (ICACTE), 2010 3rd International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-6539-2
DOI :
10.1109/ICACTE.2010.5579631