DocumentCode
173997
Title
Transliteration Based Bengali Text Compression using Huffman principle
Author
Hossain, M. Mofazzal ; Habib, Ahsan ; Rahman, Md Saifur
Author_Institution
Comput. Sci. & Eng., Shahjalal Univ. of Sci. & Technol., Sylhet, Bangladesh
fYear
2014
fDate
23-24 May 2014
Firstpage
1
Lastpage
6
Abstract
In this paper, we propose a new technique to compress more symbolic language like Bengali through less symbolic language like English using Huffman principle. First we transliterate the text of more symbolic language to less symbolic language, and then we apply Huffman principle on the transliterated text. We have also shown that our transliteration based proposed method outperform the existing basic Huffman technique for every piece of Bengali text and significant compression ratio can be achieved.
Keywords
data compression; natural language processing; text analysis; Huffman principle; symbolic language; transliteration based Bengali text compression; Computer science; Conferences; Data compression; Encoding; Floors; Informatics; Vegetation; ASCII code; Avro; Bengali text; Data compression; Huffman principle; Transliteration; UNICODE;
fLanguage
English
Publisher
ieee
Conference_Titel
Informatics, Electronics & Vision (ICIEV), 2014 International Conference on
Conference_Location
Dhaka
Print_ISBN
978-1-4799-5179-6
Type
conf
DOI
10.1109/ICIEV.2014.6850745
Filename
6850745
Link To Document