• DocumentCode
    1679699
  • Title

    Multiobjective Approach for Feature Selection in Maximum Entropy Based Named Entity Recognition

  • Author

    Ekbal, Asif ; Saha, Sriparna ; Hasanuzzaman, Md

  • Author_Institution
    Univ. of Trento, Trento, Italy
  • Volume
    1
  • fYear
    2010
  • Firstpage
    323
  • Lastpage
    326
  • Abstract
    In this paper, we present the problem of appropriate feature selection for constructing a Maximum Entropy (ME) based Named Entity Recognition (NER) system under the multiobjective optimization (MOO) framework. Two conflicting objective functions are simultaneously optimized using the search capability of MOO. These objectives are (i). the dimensionality of features, which is tried to be minimized, and (ii). the corresponding F-measure value of the classifier, trained using the features present, is maximized. The features are encoded in the chromosomes. Thereafter, a multiobjective evolutionary algorithm in the steps of a popular MOO technique, NSGA-II, is developed to determine the appropriate feature subset. The proposed technique is evaluated to determine the suitable feature combinations for NER in a resource-constrained language, namely Bengali. Evaluation results yield the recall, precision and F-measure values of 72.45%, 82.39% and 77.11%, respectively.
  • Keywords
    maximum entropy methods; natural language processing; optimisation; pattern classification; Bengali; F-measure value; chromosome; classifier; feature selection; maximum entropy; multiobjective evolutionary algorithm; multiobjective optimization; named entity recognition; resource-constrained language; search capability; Biological cells; Context; Gallium; Optimization; Support vector machines; Training; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Tools with Artificial Intelligence (ICTAI), 2010 22nd IEEE International Conference on
  • Conference_Location
    Arras
  • ISSN
    1082-3409
  • Print_ISBN
    978-1-4244-8817-9
  • Type

    conf

  • DOI
    10.1109/ICTAI.2010.54
  • Filename
    5670053