• DocumentCode
    137150
  • Title

    New encoding schemes for efficient multilingual text messaging

  • Author

    Jalan, Ankit ; Rajawat, Ketan ; Hegde, Rajesh M.

  • Author_Institution
    Indian Inst. of Technol. Kanpur, Kanpur, India
  • fYear
    2014
  • fDate
    Feb. 28 2014-March 2 2014
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Short messaging service (SMS) using cell phones is a very popular mode of interaction. However current encoding schemes allow the transmission of only 160 characters in English. On the other hand only seventy characters can be transmitted in Indian languages like Hindi due to the UNICODE format used herein. In this paper, a novel encoding scheme is proposed along with several modifications to standard schemes making them efficient for transmission of Hindi and multilingual text. The encoding schemes allow the transmission of around 160 characters for pure Hindi, and multilingual text. The efficiency of the proposed schemes is evaluated by conducting experiments on a multilingual database specially collected from twitter using dictionary learning. Performance evaluation shows that these encoding schemes allow nearly 160 characters per SMS for messages in both Hindi and multilingual text.
  • Keywords
    electronic messaging; encoding; natural language processing; text analysis; SMS; UNICODE format; dictionary learning; encoding schemes; multilingual database; multilingual text messaging; short messaging service; Algorithm design and analysis; Databases; Decoding; Dictionaries; Encoding; Standards; Switches;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications (NCC), 2014 Twentieth National Conference on
  • Conference_Location
    Kanpur
  • Type

    conf

  • DOI
    10.1109/NCC.2014.6811326
  • Filename
    6811326