• DocumentCode
    2159500
  • Title

    An efficient compressor for biological sequences

  • Author

    Gupta, Arpan ; Dubey, K.K.

  • Author_Institution
    Dept. of CS & IT, MJP Rohilkhand Univ., Bareilly, India
  • fYear
    2013
  • fDate
    22-23 Feb. 2013
  • Firstpage
    690
  • Lastpage
    695
  • Abstract
    This paper introduces a state of art compressor for DNA sequences that makes use of a replacement method. The replacement method introduces words and a word based compression scheme is used for encoding. The encoder uses frequency distribution for assigning the code of words. The designed statistical compression algorithm is efficient and effective for DNA sequence compression. Experiments show that our algorithm is shown to outperform existing compressors on typical DNA sequence datasets.
  • Keywords
    DNA; bioinformatics; data compression; encoding; statistical analysis; DNA sequence compression; DNA sequence datasets; biological sequence compressor; encoding; frequency distribution; replacement method; state of art compressor; statistical compression algorithm; word based compression scheme; Biological information theory; Compression algorithms; Context; DNA; Dictionaries; Encoding; Vocabulary; DNA compression; DNA sequences; Word based tagged code;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advance Computing Conference (IACC), 2013 IEEE 3rd International
  • Conference_Location
    Ghaziabad
  • Print_ISBN
    978-1-4673-4527-9
  • Type

    conf

  • DOI
    10.1109/IAdCC.2013.6514310
  • Filename
    6514310