Title :
Hypothesis generation and data quality assessment through association mining
Author :
Chen, Ping ; Garcia, Walter
Author_Institution :
Dept. of Comput. & Math Sci., Univ. of Houston, Houston, TX, USA
Abstract :
Association mining aims to find valid correlations among data attributes, and has been widely applied to many areas of data analysis. In this paper we present a semantic network based association analysis model including three spreading activation methods, and apply this model to assess the quality of a dataset, and generate semantically valid new hypotheses for further investigation. We evaluate our approach on a real public health dataset, the Heartfelt study, and the experiment shows promising results.
Keywords :
data analysis; data mining; medical administrative data processing; semantic networks; association analysis model; association mining; data analysis; data attributes; data quality assessment; dataset quality; hypothesis generation; public health dataset; semantic network; spreading activation methods; Analytical models; Association rules; Diseases; Knowledge engineering; Public healthcare; Semantics; Association rule mining; Data quality assessment; Hypothesis generation; Semantic network;
Conference_Titel :
Cognitive Informatics (ICCI), 2010 9th IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-8041-8
DOI :
10.1109/COGINF.2010.5599828