• DocumentCode
    2348874
  • Title

    Combining classifiers for supertagging Arabic texts

  • Author

    Ben Othmane Zribi, Chiraz ; Ben Fraj, Feriel ; Ben Ahmed, Mohamed

  • Author_Institution
    RIADI-GDL Lab., ENSI, La Manouba, Tunisia
  • fYear
    2010
  • fDate
    21-23 Aug. 2010
  • Firstpage
    1
  • Lastpage
    9
  • Abstract
    This paper deals with supertagging Arabic texts with ArabTAG formalism, a semi-lexicalised grammar based on TAG and adapted for Arabic. Supertagging is a very useful task because it reduces and speeds the work of parsing. We view this problem as a classification task where elementary structures supertags (classes) are affected to words in a given sentence according to their description (morpho-syntactic and contextual information). We propose to combine three classifiers: Naïve Bayes, k-Nearest Neighbors (k-NN) and Decision tree by a voting procedure. The primary results were satisfactory as we obtained an accuracy rate of 76% although the small size of our training corpus (5,000 words) and the difficulties related to Arabic language specificities.
  • Keywords
    Bayes methods; decision trees; learning (artificial intelligence); natural language processing; pattern classification; text analysis; ArabTAG formalism; Arabic text supertagging; Naïve Bayes classifiers; contextual information; decision tree classifiers; elementary structures supertags; k-nearest neighbors classifiers; morphosyntactic description; semilexicalised grammar; Grammar; ArabTAG; Arabic language; Supertagging; classification; ensemble learning; machine learning; tree adjoining grammar;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering (NLP-KE), 2010 International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-6896-6
  • Type

    conf

  • DOI
    10.1109/NLPKE.2010.5587841
  • Filename
    5587841