• DocumentCode
    234799
  • Title

    A system for identification of idioms in Hindi

  • Author

    Priyanka ; Sinha, R.M.K.

  • Author_Institution
    Comput. Sci. & Eng., Noida, India
  • fYear
    2014
  • fDate
    7-9 Aug. 2014
  • Firstpage
    467
  • Lastpage
    472
  • Abstract
    Idioms are extensively used in everyday language. They carry a metaphorical sense that makes their comprehension difficult as their meaning cannot be deduced from the meaning of their constituent parts. They pose a challenge for Natural language processing (NLP) applications like machine translation, information retrieval and question answering as their translation and meaning needs to be derived logically rather than literally. A lot of research work has been carried out into automatic extraction of multi-word expressions, but no comprehensive work has been reported on idioms in Hindi. In this paper, an attempt has been made to study the linguistic and morphological variations that are usually encountered in idioms in Hindi. Based on this study, a methodology for deriving rules for representation of idioms and their search has been developed. The rules representing the idioms are hand crafted. For the idiom identification, rule-base has been used to mark the input text for probable presence of idiom. Our system is limited to use only intra-sentential context. The experimental results demonstrate feasibility and scalability of our methodology.
  • Keywords
    language translation; natural language processing; question answering (information retrieval); Hindi; NLP; idiom identification; information retrieval; machine translation; metaphorical sense; multiword expressions; natural language processing; question answering; Arrays; Context; Data mining; Databases; Natural language processing; Semantics; Syntactics; Hindi; NLP; idiom variations; idioms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Contemporary Computing (IC3), 2014 Seventh International Conference on
  • Conference_Location
    Noida
  • Print_ISBN
    978-1-4799-5172-7
  • Type

    conf

  • DOI
    10.1109/IC3.2014.6897218
  • Filename
    6897218