DocumentCode :
137150
Title :
New encoding schemes for efficient multilingual text messaging
Author :
Jalan, Ankit ; Rajawat, Ketan ; Hegde, Rajesh M.
Author_Institution :
Indian Inst. of Technol. Kanpur, Kanpur, India
fYear :
2014
fDate :
Feb. 28 2014-March 2 2014
Firstpage :
1
Lastpage :
6
Abstract :
Short messaging service (SMS) using cell phones is a very popular mode of interaction. However current encoding schemes allow the transmission of only 160 characters in English. On the other hand only seventy characters can be transmitted in Indian languages like Hindi due to the UNICODE format used herein. In this paper, a novel encoding scheme is proposed along with several modifications to standard schemes making them efficient for transmission of Hindi and multilingual text. The encoding schemes allow the transmission of around 160 characters for pure Hindi, and multilingual text. The efficiency of the proposed schemes is evaluated by conducting experiments on a multilingual database specially collected from twitter using dictionary learning. Performance evaluation shows that these encoding schemes allow nearly 160 characters per SMS for messages in both Hindi and multilingual text.
Keywords :
electronic messaging; encoding; natural language processing; text analysis; SMS; UNICODE format; dictionary learning; encoding schemes; multilingual database; multilingual text messaging; short messaging service; Algorithm design and analysis; Databases; Decoding; Dictionaries; Encoding; Standards; Switches;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications (NCC), 2014 Twentieth National Conference on
Conference_Location :
Kanpur
Type :
conf
DOI :
10.1109/NCC.2014.6811326
Filename :
6811326
Link To Document :
بازگشت