• DocumentCode
    3222564
  • Title

    Using text corpora for understanding polysemy in Bangla

  • Author

    Dash, Niladri Sekhar ; Chaudhuri, Bidyut Baran

  • Author_Institution
    Comput. Vision & Pattern Recognition Unit, Indian Stat. Inst., Calcutta, India
  • fYear
    2002
  • fDate
    13-15 Dec. 2002
  • Firstpage
    99
  • Lastpage
    109
  • Abstract
    Polysemy implies the presence of more than one sense of a particular word both in its context-bound and context-free situation. The inherent aspect of polysemy is that a particular word will show multiple sense variations related by way of semantic extension and conceptual expansion. In the last fifty years or so, polysemy has been recognised as one of the central issues in lexical semantics, word sense disambiguation (WSD), actual sense extraction (ASE), language learning, conceptual categorisation of words as well as in computer processing of language. Language users can identify a polysemous word quite easily, but are not equipped to decipher all its possible sense variations without appropriate reference to a proper knowledge-base and other relevant information embedded within contextual environments. We make an empirical effort to understand the basic nature of polysemy in Bangla. We also intend to know how words denote sense variations, which factors are instrumental in making them polysemous, what impact they have on language understanding, and how sense variation can be best understood using information from various sources of knowledge bases. Finally, we use a method to understand the role of various contexts, obtained from corpora, in maintaining an interface between words and their sense variations. Quick reference to local context is handy at times, but reference to focal, topical and global contexts as well as an extralinguistic knowledge base is necessary for understanding sense variation and obtaining actual contextual sense.
  • Keywords
    computational linguistics; natural languages; Bangla language; actual sense extraction; computer processing; conceptual categorisation; conceptual expansion; context-bound situation; context-free situation; extralinguistic knowledge base; focal context; global context; language learning; language understanding; lexical semantics; local context; multiple sense variations; polysemy; semantic extension; text corpora; topical context; word; word sense disambiguation; Artificial intelligence; Cognition; Computational linguistics; Computer vision; Data mining; Humans; Information analysis; Instruments; Pattern recognition; Psychology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Language Engineering Conference, 2002. Proceedings
  • Print_ISBN
    0-7695-1885-0
  • Type

    conf

  • DOI
    10.1109/LEC.2002.1182297
  • Filename
    1182297