• DocumentCode
    719159
  • Title

    Principal component based method for whole genome phylogenetic analysis without alignment: Application to HEV genotype

  • Author

    Sahana, Subrata ; Das, Sanjoy ; Sarkar, Bimal Kumar

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Galgotias Univ., Greater Noida, India
  • fYear
    2015
  • fDate
    15-16 May 2015
  • Firstpage
    984
  • Lastpage
    989
  • Abstract
    We describe principal component method for the DNA sequence analysis using digital filters. With the huge amount of data accessible in the public domain, digital filters are very helpful in DNA sequence processing. In this technique, the occurrence frequency of the q-gram genetic word of interest is determined from the DNA sequence. The sequence is then elucidated by using finite impulse response (FIR) type filter in order to determine the q-gram word density along the sequence. The word density distribution is further used for principal component analysis (PCA) to determine the similarity / dissimilarity between the sequences. The technique is verified by using 48 HEV genotypes. The results are in good agreement with other methodology.
  • Keywords
    DNA; FIR filters; bioinformatics; digital filters; genetics; principal component analysis; DNA sequence analysis; DNA sequence processing; FIR type filter; HEV genotype; PCA; data accessibility; digital filters; finite impulse response type filter; occurrence frequency; principal component based method; public domain; q-gram genetic word; q-gram word density; sequence dissimilarity; sequence similarity; whole-genome phylogenetic analysis; Bioinformatics; DNA; Genomics; Hybrid electric vehicles; Phylogeny; Principal component analysis; Strain; DNA sequence; HEV; digital filter; principal component analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing, Communication & Automation (ICCCA), 2015 International Conference on
  • Conference_Location
    Noida
  • Print_ISBN
    978-1-4799-8889-1
  • Type

    conf

  • DOI
    10.1109/CCAA.2015.7148518
  • Filename
    7148518