• DocumentCode
    376247
  • Title

    Does natural selection apply to natural language processing? an experiment for multiword unit extraction

  • Author

    Dias, Gaël ; Nunes, Sérgio

  • Author_Institution
    Center of Math., Beira Interior Univ., Covilha, Portugal
  • Volume
    1
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    205
  • Abstract
    In this paper, we focus on the suitability of natural selection for the extraction of Multiword Units (i.e. complex lexical units such as compound nouns, idiomatic expressions or phrase templates). For that purpose, a fitness function is defined whose maximization serves as a basis for the identification of pertinent word N-grams together with a similarity measure. In order to propose a suitable platform for evaluation, a software application called GALEMU (Genetic ALgorithm for the Extraction of Multiword Units) has been implemented. Finally, we will provide an experiment realized over an unnnotated text corpus extracted from the database collection of the European Commission that evidences results with high precision rate
  • Keywords
    genetic algorithms; natural languages; GALEMU; fitness function; natural language processing; natural selection; similarity measures; Application software; Biological cells; Content based retrieval; Databases; Genetic algorithms; Humans; Indexing; Information retrieval; Mathematics; Natural language processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics, 2001 IEEE International Conference on
  • Conference_Location
    Tucson, AZ
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-7087-2
  • Type

    conf

  • DOI
    10.1109/ICSMC.2001.969813
  • Filename
    969813