• DocumentCode
    477455
  • Title

    Application of Immune Classifier Based on Increment of Diversity in the Model Species Genomes Identification

  • Author

    Wang, Lianhong ; Zhang, Jing ; Gong, Gufeng ; Peng, Minfang

  • Author_Institution
    Coll. of Electr. & Inf. Eng., Hunan Univ., Changsha
  • Volume
    1
  • fYear
    2008
  • fDate
    20-22 Oct. 2008
  • Firstpage
    30
  • Lastpage
    35
  • Abstract
    The model species genomes project has played a very important role in the research of human genomes. By comparing and identifying the genomes´ information in the different biological evolution stages with the model species genomes, it is favorable to understand deeply the genomes´ structure and function of the advanced species, especially the human being, and to reveal the life´s essential law. In the paper, the DNA sequences of three model species including C. elegans, S. cerevisiae and A.thaliana are divided into three types: intron, exon, intergenic DNA at first. The bases of each sequence are given number (1, 2, hellip, N) base by base from the beginning. According to three phases of the codons, we can extract three subsequences which numbers are respectively 3n+1, 3n+2 and 3n+3 (n=0, 1, 2, ...,N/3-1). Calculating respectively the probability of four bases in three subsequence we can attain 12 parameters which are regarded as the state parameters of the diversity sources. Each sequence of intron, exon and intergenic DNA is expressed by a 12-dimension eigenvector. All of these sequences´ 12-dimension samples are divided into the training sample set and the testing sample set. Then, according to the immune network theory, a immune classifier based on the diversity increment is constructed, which considers the reciprocal of the diversity increment as affinity function and makes the immune network evolve in the direction of identifying the antigens by submitting unceasingly the antigens one by one, which is a record of the training sample set. Finally, the classifier has high performance and its prediction accuracy is up to 85% by test.
  • Keywords
    DNA; bioinformatics; eigenvalues and eigenfunctions; genomics; pattern classification; 12-dimension eigenvector; A.thaliana; C. elegans; DNA sequences; S. cerevisiae; diversity increment; exon DNA; immune classifier; immune network theory; intergenic DNA; intron DNA; model species genomes identification; Bioinformatics; Biological system modeling; DNA; Evolution (biology); Genomics; Humans; Immune system; Probability; Sequences; Testing; Genomes Identification; Immune Classifier; Model Species; the Diversity Increment;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Computation Technology and Automation (ICICTA), 2008 International Conference on
  • Conference_Location
    Hunan
  • Print_ISBN
    978-0-7695-3357-5
  • Type

    conf

  • DOI
    10.1109/ICICTA.2008.126
  • Filename
    4659437