• DocumentCode
    172499
  • Title

    Automatic detection of subject/object drops in Bengali

  • Author

    Das, Aruneema ; Garain, U. ; Senapati, Apurbalal

  • Author_Institution
    CVPR Unit Indian Stat. Inst. Kolkata, Kolkata, India
  • fYear
    2014
  • fDate
    20-22 Oct. 2014
  • Firstpage
    91
  • Lastpage
    94
  • Abstract
    This paper presents a pioneering attempt for automatic detection of drops in Bengali. The dominant drops in Bengali refer to subject, object and verb drops. Bengali is a pro-drop language and pro-drops fall under subject/object drops which this research concentrates on. The detection algorithm makes use of off-the-shelf Bengali NLP tools like POS tagger, chunker and a dependency parser. Simple linguistic rules are initially applied to quickly annotate a dataset of 8,455 sentences which are then manually checked. The corrected dataset is then used to train two classifiers that classify a sentence to either one with a drop or no drop. The features previously used by other researchers have been considered. Both the classifiers show comparable overall performance. As a by-product, the current study generates another (apart from the drop-annotated dataset) useful NLP resource, i.e. classification of Bengali verbs (all morphological variants of 881 root verbs) as per their transitivity which in turn used as a feature by the classifiers.
  • Keywords
    grammars; linguistics; natural language processing; pattern classification; word processing; Bengali; POS tagger; automatic drops detection; chunker; dependencv parser; linguistic rules; off-the-shelf Bengali NLP tools; pro-drop language; sentence classification; subject-object drops; verb drops; Bismuth; TV; Classification;; Dependency parsing; POS tagging; Subject/Object drop;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asian Language Processing (IALP), 2014 International Conference on
  • Conference_Location
    Kuching
  • Type

    conf

  • DOI
    10.1109/IALP.2014.6973488
  • Filename
    6973488