Title :
Classification of CpG Islands in the Human Genome Based on the Interval Distance Distribution of Adjacent CG Sites
Author :
Qi, Changle ; Wu, Xiaoming ; Liu, Lili ; Du, Jianqiang ; Wang, Bo
Author_Institution :
Key Lab. of Biomed. Inf. Eng. of Minist. of Educ., X´´ian Jiaotong Univ., X´´ian, China
fDate :
March 31 2009-April 2 2009
Abstract :
There have been many studies analyzing relations between CpG islands and gene functions. Most results showed that promoters of many housekeeping genes contain CpG islands, however, the relation between gene functions and CG dinucleotides positions in CpG islands was less considered. In this study, we try to classify CpG islands according to interval distance distribution of adjacent CG sites and find some functional correlations. First the human genome sequences were downloaded from the EMBL Nucleotide Sequence Database. Then a dataset was constructed, each record of which is an interval distance distribution of adjacent CG sites of a CpG island. Finally an algorithm was designed, which can calculate approximately minimal difference of any two records. Based on the algorithm, we obtained many classes using the hierarchical clustering method, each of which contains some similar CpG islands, and some of their common features were studied.
Keywords :
biology computing; genetics; CG sites; CpG islands; gene functions; genomic regions; hierarchical clustering method; human genome; human genome sequences; interval distance distribution; Bioinformatics; Biomedical engineering; Character generation; Clustering algorithms; Computer science; DNA; Databases; Genomics; Humans; Sequences;
Conference_Titel :
Computer Science and Information Engineering, 2009 WRI World Congress on
Conference_Location :
Los Angeles, CA
Print_ISBN :
978-0-7695-3507-4
DOI :
10.1109/CSIE.2009.822