• DocumentCode
    1619522
  • Title

    Word based data compression schemes

  • Author

    Bar-Ness, Yeheskel ; Peckham, Christopher

  • Author_Institution
    Center for Commun. & Signal Process. Res., New Jersey Inst. of Technol., Newark, NJ, USA
  • fYear
    1989
  • Firstpage
    300
  • Abstract
    Documents, papers, and reports contain large amounts of redundancy. This redundancy can be minimized by data-compression techniques to save storage space or to increase transmission efficiency. Several data-compression algorithms that are character based have been proposed in the literature. In English test files, however, the natural units of repetition are words or phrases, rather than characters. Three different source models for word-based data compression are proposed: move to front, frequency to front, and alpha-numeric to front. Their principles and methods for encoding their gathered data context are presented. Results of compression ratios obtained are included and compared. Comparisons with the performances of the Lempel-Ziv algorithm and fourth-order arithmetic encoding are also made. Some ideas for further improving the performance already obtained are proposed
  • Keywords
    data compression; encoding; English test files; alpha-numeric to front; compression ratios; encoding; frequency to front; move to front; redundancy; source models; word-based data compression; Arithmetic; Data compression; Dictionaries; Frequency; Signal processing; Signal processing algorithms; Statistics; Technological innovation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuits and Systems, 1989., IEEE International Symposium on
  • Conference_Location
    Portland, OR
  • Type

    conf

  • DOI
    10.1109/ISCAS.1989.100350
  • Filename
    100350