• DocumentCode
    322418
  • Title

    Annotated statistical indices for sequence analysis

  • Author

    Apostolico, Alberto ; Bock, Mary Ellen ; Xu, Xuyan

  • Author_Institution
    Dept. of Comput. Sci., Purdue Univ., West Lafayette, IN, USA
  • fYear
    1997
  • fDate
    11-13 Jun 1997
  • Firstpage
    215
  • Lastpage
    229
  • Abstract
    A statistical index for string x is a digital-search tree or trie that returns, for any query string ω and in a number of comparisons bounded by the length of ω, the number of occurrences of ω in x. Clever algorithms are available that support the construction and weighting of such indices in time and space linear in the length of x. This paper addresses the problem of annotating a statistical index with such parameters as the expected value and variance of the number of occurrences of each substring
  • Keywords
    pattern recognition; sequences; statistical analysis; tree searching; annotated statistical indices; construction; digital-search tree; occurrence; query string; sequence analysis; statistical index; substring; trie; variance; weighting; Algorithm design and analysis; Bioinformatics; Frequency measurement; Genomics; Pattern analysis; Pattern matching; Sequences; Statistical analysis; Statistics; USA Councils;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Compression and Complexity of Sequences 1997. Proceedings
  • Conference_Location
    Salerno
  • Print_ISBN
    0-8186-8132-2
  • Type

    conf

  • DOI
    10.1109/SEQUEN.1997.666917
  • Filename
    666917