• DocumentCode
    2083633
  • Title

    Pattern Recognition based DNA Sequence Compressor

  • Author

    Arokiaraj, S. Panneer ; Robert, L.

  • Author_Institution
    Comput. Sci., Periyar EVR Coll., Trichy, India
  • fYear
    2012
  • fDate
    18-20 Dec. 2012
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    The genome of all living organisms contains all hereditary information encoded in DNA. Due to the ever increasing demand on DNA based technological developments, a larger number of DNA sequences have been identified and stored in genomic databases. The sizes of the databases are expected to increase exponentially. Compression is hence desirable to reduce the storage requirements as well as the transmission time. This paper aims the same by proposing Pattern Recognition based DNA Sequence Compression (PRDNAC) algorithm that compresses the DNA sequences by identifying the appropriate patterns to achieve a good compression ratio and compression gain much better than many standard compressors including RARLabs´ WinRAR version 4.x.
  • Keywords
    biology computing; data compression; database management systems; genomics; molecular biophysics; pattern recognition; PRDNAC algorithm; WinRAR compressor; compression gain; compression ratio; data compression; genomic database; pattern recognition based DNA sequence compressor; Bit Pattern; Compression Ratio; Compression gain; DNA Sequence Compression; PRDNAC;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence & Computing Research (ICCIC), 2012 IEEE International Conference on
  • Conference_Location
    Coimbatore
  • Print_ISBN
    978-1-4673-1342-1
  • Type

    conf

  • DOI
    10.1109/ICCIC.2012.6510211
  • Filename
    6510211