DocumentCode
376247
Title
Does natural selection apply to natural language processing? an experiment for multiword unit extraction
Author
Dias, Gaël ; Nunes, Sérgio
Author_Institution
Center of Math., Beira Interior Univ., Covilha, Portugal
Volume
1
fYear
2001
fDate
2001
Firstpage
205
Abstract
In this paper, we focus on the suitability of natural selection for the extraction of Multiword Units (i.e. complex lexical units such as compound nouns, idiomatic expressions or phrase templates). For that purpose, a fitness function is defined whose maximization serves as a basis for the identification of pertinent word N-grams together with a similarity measure. In order to propose a suitable platform for evaluation, a software application called GALEMU (Genetic ALgorithm for the Extraction of Multiword Units) has been implemented. Finally, we will provide an experiment realized over an unnnotated text corpus extracted from the database collection of the European Commission that evidences results with high precision rate
Keywords
genetic algorithms; natural languages; GALEMU; fitness function; natural language processing; natural selection; similarity measures; Application software; Biological cells; Content based retrieval; Databases; Genetic algorithms; Humans; Indexing; Information retrieval; Mathematics; Natural language processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man, and Cybernetics, 2001 IEEE International Conference on
Conference_Location
Tucson, AZ
ISSN
1062-922X
Print_ISBN
0-7803-7087-2
Type
conf
DOI
10.1109/ICSMC.2001.969813
Filename
969813
Link To Document