• DocumentCode
    2861484
  • Title

    Discovery of Prokaryotic Relationships through Latent Structure of Correlated Nucleotide Sequences

  • Author

    Muzinich, Natalya

  • Author_Institution
    Indiana University
  • fYear
    2005
  • fDate
    25-25 June 2005
  • Firstpage
    143
  • Lastpage
    143
  • Abstract
    This paper describes an application of statistical techniques that have yielded fruitful results in many fields including artificial intelligence and information retrieval to the problem of establishing relationships among organisms. A combination of these techniques constitutes a new method of comparing organisms based on their whole genomic sequences. The method represents genomes as sets of short overlapping nucleotide subsequences and employs latent structure modeling to capture correlations in the observed patterns of their distribution. Factor scores computed to measure the correlations serve as the input to a Ward’s hierarchical cluster analysis method, which produces a tree of their relationships. The runtime results indicate that this method allows for the fast and efficient comparison that scales well as the number of organisms increases.
  • Keywords
    Ward’s hierarchical cluster analysis; principal component analysis; singular value decomposition; whole genome sequence; Artificial intelligence; Bioinformatics; DNA; Genetics; Genomics; Information retrieval; Organisms; Phylogeny; Proteins; Sequences; Ward’s hierarchical cluster analysis; principal component analysis; singular value decomposition; whole genome sequence;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition - Workshops, 2005. CVPR Workshops. IEEE Computer Society Conference on
  • Conference_Location
    San Diego, CA, USA
  • ISSN
    1063-6919
  • Print_ISBN
    0-7695-2372-2
  • Type

    conf

  • DOI
    10.1109/CVPR.2005.443
  • Filename
    1565461