DocumentCode :
1950801
Title :
Compression by induction of hierarchical grammars
Author :
Nevill-manning, Craig G. ; Witten, Ian H. ; Maulsby, David L.
Author_Institution :
Dept. of Comput. Sci., Waikato Univ., Hamilton, New Zealand
fYear :
1994
fDate :
29-31 Mar 1994
Firstpage :
244
Lastpage :
253
Abstract :
The paper describes a technique that constructs models of symbol sequences in the form of small, human-readable, hierarchical grammars. The grammars are both semantically plausible and compact. The technique can induce structure from a variety of different kinds of sequence, and examples are given of models derived from English text, C source code and a sequence of terminal control codes. It explains the grammatical induction technique, demonstrates its application to three very different sequences, evaluates its compression performance, and concludes by briefly discussing its use as a method for knowledge acquisition
Keywords :
data compression; grammars; C source code; English text; adaptive compression methods; compression performance; grammatical induction technique; hierarchical grammars; knowledge acquisition; sequences; symbol sequences; terminal control codes; Arithmetic; Computer science; Context modeling; Data compression; Dictionaries; Frequency; Knowledge acquisition; Production; Statistical analysis; Telephony;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Compression Conference, 1994. DCC '94. Proceedings
Conference_Location :
Snowbird, UT
Print_ISBN :
0-8186-5637-9
Type :
conf
DOI :
10.1109/DCC.1994.305932
Filename :
305932
Link To Document :
بازگشت