Title :
Relational Analysis of CpG Islands Methylation and Gene Expression in Human Lymphomas Using Possibilistic C-Means Clustering and Modified Cluster Fuzzy Density
Author :
Sjahputera, Ozy ; Keller, James M. ; Davis, J. Wade ; Taylor, Kristen H. ; Rahmatpanah, Farahnaz ; Shi, Huidong ; Anderson, Derek T. ; Blisard, Samuel N. ; Luke, Robert H., III ; Popescu, Mihail ; Arthur, Gerald C. ; Caldwell, Charles W.
Author_Institution :
Dept. of Pathology, Missouri Univ. Sch. of Med., Columbia, MO
Abstract :
Heterogeneous genetic and epigenetic alterations are commonly found in human non-Hodgkin´s lymphomas (NHL). One such epigenetic alteration is aberrant methylation of gene promoter-related CpG islands, where hypermethylation frequently results in transcriptional inactivation of target genes, while a decrease or loss of promoter methylation (hypomethylation) is frequently associated with transcriptional activation. Discovering genes with these relationships in NHL or other types of cancers could lead to a better understanding of the pathobiology of these diseases. The simultaneous analysis of promoter methylation using differential methylation hybridization (DMH) and its associated gene expression using expressed CpG island sequence tag (ECIST) microarrays generates a large volume of methylation-expression relational data. To analyze this data, we propose a set of algorithms based on fuzzy sets theory, in particular possibilistic c-means (PCM) and cluster fuzzy density. For each gene, these algorithms calculate measures of confidence of various methylation-expression relationships in each NHL subclass. Thus, these tools can be used as a means of high volume data exploration to better guide biological confirmation using independent molecular biology methods
Keywords :
arrays; biochemistry; cancer; fuzzy set theory; genetics; medical computing; molecular biophysics; statistical analysis; CpG islands methylation; cancers; differential methylation hybridization; epigenetic alterations; expressed CpG island sequence tag microarrays; fuzzy sets theory; gene expression; genetic alterations; human lymphomas; human nonHodgkin lymphomas; hypermethylation; hypomethylation; independent molecular biology methods; modified cluster fuzzy density; pathobiology; possibilistic c-means clustering; relational analysis; transcription; Cancer; Clustering algorithms; Data analysis; Diseases; Fuzzy set theory; Gene expression; Genetics; Humans; Hybrid power systems; Sequences; Methylation; cluster density.; clustering; expression; fuzzy sets; microarray; Artificial Intelligence; Cluster Analysis; Computer Simulation; CpG Islands; DNA Methylation; Data Interpretation, Statistical; Fuzzy Logic; Gene Expression Profiling; Humans; Lymphoma, Non-Hodgkin; Models, Genetic; Models, Statistical; Neoplasm Proteins; Oligonucleotide Array Sequence Analysis; Pattern Recognition, Automated; Statistics as Topic; Tumor Markers, Biological;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2007.070205