Title :
Challenges and techniques for mining real clinical data
Author_Institution :
Univ. of California, Los Angeles, CA
Abstract :
Regression analysis and statistical hypothesis testing are commonly used for association and classification of clinical data sets in medical studies. Although such traditional techniques are wildly used, they have several shortcomings. For example, when analyzing datasets with a large number of temporal attributes, domain experts often miss important associative attributes in regression analysis because of the large number of correlated attributes. On the other hand, for rare occurring diseases or operations, the number of documented observed cases is usually small, and hypothesis testing becomes ineffective for such analysis due to insufficient statistical significance. We shall present two such case studies to showcase how data mining techniques [1-7] can be used to remedy such shortcomings.
Keywords :
data mining; medical computing; regression analysis; statistical testing; clinical data sets; medical studies; real clinical data mining; regression analysis; statistical hypothesis testing; temporal attributes; Antidepressants; Bladder; Data mining; Diseases; Drugs; Pregnancy; Regression analysis; Statistical analysis; Surgery; Testing;
Conference_Titel :
Granular Computing, 2008. GrC 2008. IEEE International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4244-2512-9
Electronic_ISBN :
978-1-4244-2513-6
DOI :
10.1109/GRC.2008.4664809