DocumentCode
137150
Title
New encoding schemes for efficient multilingual text messaging
Author
Jalan, Ankit ; Rajawat, Ketan ; Hegde, Rajesh M.
Author_Institution
Indian Inst. of Technol. Kanpur, Kanpur, India
fYear
2014
fDate
Feb. 28 2014-March 2 2014
Firstpage
1
Lastpage
6
Abstract
Short messaging service (SMS) using cell phones is a very popular mode of interaction. However current encoding schemes allow the transmission of only 160 characters in English. On the other hand only seventy characters can be transmitted in Indian languages like Hindi due to the UNICODE format used herein. In this paper, a novel encoding scheme is proposed along with several modifications to standard schemes making them efficient for transmission of Hindi and multilingual text. The encoding schemes allow the transmission of around 160 characters for pure Hindi, and multilingual text. The efficiency of the proposed schemes is evaluated by conducting experiments on a multilingual database specially collected from twitter using dictionary learning. Performance evaluation shows that these encoding schemes allow nearly 160 characters per SMS for messages in both Hindi and multilingual text.
Keywords
electronic messaging; encoding; natural language processing; text analysis; SMS; UNICODE format; dictionary learning; encoding schemes; multilingual database; multilingual text messaging; short messaging service; Algorithm design and analysis; Databases; Decoding; Dictionaries; Encoding; Standards; Switches;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications (NCC), 2014 Twentieth National Conference on
Conference_Location
Kanpur
Type
conf
DOI
10.1109/NCC.2014.6811326
Filename
6811326
Link To Document