DocumentCode :
173997
Title :
Transliteration Based Bengali Text Compression using Huffman principle
Author :
Hossain, M. Mofazzal ; Habib, Ahsan ; Rahman, Md Saifur
Author_Institution :
Comput. Sci. & Eng., Shahjalal Univ. of Sci. & Technol., Sylhet, Bangladesh
fYear :
2014
fDate :
23-24 May 2014
Firstpage :
1
Lastpage :
6
Abstract :
In this paper, we propose a new technique to compress more symbolic language like Bengali through less symbolic language like English using Huffman principle. First we transliterate the text of more symbolic language to less symbolic language, and then we apply Huffman principle on the transliterated text. We have also shown that our transliteration based proposed method outperform the existing basic Huffman technique for every piece of Bengali text and significant compression ratio can be achieved.
Keywords :
data compression; natural language processing; text analysis; Huffman principle; symbolic language; transliteration based Bengali text compression; Computer science; Conferences; Data compression; Encoding; Floors; Informatics; Vegetation; ASCII code; Avro; Bengali text; Data compression; Huffman principle; Transliteration; UNICODE;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Informatics, Electronics & Vision (ICIEV), 2014 International Conference on
Conference_Location :
Dhaka
Print_ISBN :
978-1-4799-5179-6
Type :
conf
DOI :
10.1109/ICIEV.2014.6850745
Filename :
6850745
Link To Document :
بازگشت