• DocumentCode
    151479
  • Title

    Detection and correction of non word spelling errors in Hindi language

  • Author

    Jain, Abhishek ; Jain, Manan

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Ambedkar Inst. of Adv. Commun. Technol. & Res., New Delhi, India
  • fYear
    2014
  • fDate
    5-6 Sept. 2014
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Hindi, the majority language of India, is still in its infancy stage concerning to natural language processing applications and research. Spelling detection and correction for Hindi language is an important process which has received inadequate attention. Today, there is lots of work available for English Spelling detection and correction but for Hindi, not much work has been done. Unlike English which has only two types of letters i.e. 21 consonants and 5 vowels, Hindi language in addition to 40 consonants, 10 vowels, consists of various types of symbols (ten vowel sign (matras), half letters & halant etc.). So the methods available for English cannot be directly applied for Hindi. Various documents, newspaper, novels, books, magazines, government notice etc. are typed in Hindi so there is need for development of spelling detection and correction tools for Hindi. In this paper author focused on detection and correction of spelling errors in Hindi language for non-word errors.
  • Keywords
    natural language processing; Hindi language; India; consonants; natural language processing applications; natural language processing research; nonword spelling error correction; nonword spelling error detection; vowels; Computer science; Computers; Context; Dictionaries; Error correction; Natural language processing; Probability; Dictionary lookup; Error correction; Error detection; Non word error; n-gram;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining and Intelligent Computing (ICDMIC), 2014 International Conference on
  • Conference_Location
    New Delhi
  • Print_ISBN
    978-1-4799-4675-4
  • Type

    conf

  • DOI
    10.1109/ICDMIC.2014.6954235
  • Filename
    6954235