• DocumentCode
    495502
  • Title

    Classification of CpG Islands in the Human Genome Based on the Interval Distance Distribution of Adjacent CG Sites

  • Author

    Qi, Changle ; Wu, Xiaoming ; Liu, Lili ; Du, Jianqiang ; Wang, Bo

  • Author_Institution
    Key Lab. of Biomed. Inf. Eng. of Minist. of Educ., X´´ian Jiaotong Univ., X´´ian, China
  • Volume
    4
  • fYear
    2009
  • fDate
    March 31 2009-April 2 2009
  • Firstpage
    246
  • Lastpage
    249
  • Abstract
    There have been many studies analyzing relations between CpG islands and gene functions. Most results showed that promoters of many housekeeping genes contain CpG islands, however, the relation between gene functions and CG dinucleotides positions in CpG islands was less considered. In this study, we try to classify CpG islands according to interval distance distribution of adjacent CG sites and find some functional correlations. First the human genome sequences were downloaded from the EMBL Nucleotide Sequence Database. Then a dataset was constructed, each record of which is an interval distance distribution of adjacent CG sites of a CpG island. Finally an algorithm was designed, which can calculate approximately minimal difference of any two records. Based on the algorithm, we obtained many classes using the hierarchical clustering method, each of which contains some similar CpG islands, and some of their common features were studied.
  • Keywords
    biology computing; genetics; CG sites; CpG islands; gene functions; genomic regions; hierarchical clustering method; human genome; human genome sequences; interval distance distribution; Bioinformatics; Biomedical engineering; Character generation; Clustering algorithms; Computer science; DNA; Databases; Genomics; Humans; Sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Engineering, 2009 WRI World Congress on
  • Conference_Location
    Los Angeles, CA
  • Print_ISBN
    978-0-7695-3507-4
  • Type

    conf

  • DOI
    10.1109/CSIE.2009.822
  • Filename
    5170996