Title :
Linear correlation analysis of numeric attributes for government data
Author :
Chen, Ying ; Gu, Guochang ; Lv, Tianyang ; Huang, Shaobin ; Ni, Jun
Author_Institution :
Harbin Eng. Univ., Harbin
Abstract :
To analyze the linear correlations of numeric attributes of government data, this paper proposes a method based on the clustering algorithm. A clustering method is adopted to prune outliers and the linear correlation analysis is performed for each cluster, instead for the whole dataset. In this way, the method can obtain multiple correlations between the same two attributes. The paper presents the experiment on the government social security data. Experimental results show that the proposed method is much better than the traditional regression analysis and association rule analysis.
Keywords :
data mining; government data processing; pattern clustering; public administration; regression analysis; association rule analysis; clustering algorithm; government social security data; linear correlation analysis; regression analysis; Algorithm design and analysis; Association rules; Clustering algorithms; Computer science; Data analysis; Data engineering; Databases; Government; Performance analysis; Regression analysis;
Conference_Titel :
Computer and Computational Sciences, 2007. IMSCCS 2007. Second International Multi-Symposiums on
Conference_Location :
Iowa City, IA
Print_ISBN :
978-0-7695-3039-0
DOI :
10.1109/IMSCCS.2007.91