DocumentCode :
2477147
Title :
Is the reduction of dimensionality to a small number of features always necessary in constructing predictive models for analysis of complex diseases or behaviours?
Author :
Zollanvari, Amin ; Saccone, Nancy L. ; Bierut, Laura J. ; Ramoni, Marco F. ; Alterovitz, Gil
Author_Institution :
Med. Sch., Harvard-MIT Div. of Health Sci. & Technol., Harvard Univ., Boston, MA, USA
fYear :
2011
fDate :
Aug. 30 2011-Sept. 3 2011
Firstpage :
3573
Lastpage :
3576
Abstract :
Gene expression and genome wide association data have provided researchers the opportunity to study many complex traits and diseases. When designing prognostic and predictive models capable of phenotypic classification in this area, significant reduction of dimensionality through stringent filtering and/or feature selection is often deemed imperative. Here, this work challenges this presumption through both theoretical and empirical analysis. This work demonstrates that by a proper compromise between structure of the selected model and the number of features, one is able to achieve better performance even in large dimensionality. The inclusion of many genes/variants in the classification rules can help shed new light on the analysis of complex traitstraits that are typically determined by many causal variants with small effect size.
Keywords :
data reduction; diseases; genetics; genomics; complex behaviours; complex diseases; dimensionality reduction; feature selection; gene expression; genome wide association data; large dimensionality; phenotypic classification; predictive models; prognostic models; stringent filtering; Bioinformatics; Diseases; Error analysis; Gene expression; Genomics; Predictive models; Behavior; Disease; Humans; Models, Theoretical;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering in Medicine and Biology Society, EMBC, 2011 Annual International Conference of the IEEE
Conference_Location :
Boston, MA
ISSN :
1557-170X
Print_ISBN :
978-1-4244-4121-1
Electronic_ISBN :
1557-170X
Type :
conf
DOI :
10.1109/IEMBS.2011.6090596
Filename :
6090596
Link To Document :
بازگشت