DocumentCode :
2665475
Title :
A Neural Model for Unsupervised Named Entity Classification
Author :
Chifu, Emil St ; Chifu, Emil
Author_Institution :
Dept. of Comput. Sci., Tech. Univ. of Cluj-Napoca, Cluj-Napoca, Romania
fYear :
2008
fDate :
10-12 Dec. 2008
Firstpage :
1077
Lastpage :
1082
Abstract :
The paper describes an unsupervised model for named entity classification into a large number of classes specified by an ontology. The framework is based on an extended model of hierarchical self-organizing maps. As being founded on an unsupervised neural network architecture, the framework can be applied to different languages and domains. Named entities extracted by mining a domain text corpus encode contextual content information, in a distributional vector space. The classification of the extracted named entities into the taxonomy of the given ontology proceeds by associating every named entity to one node of the taxonomy. We experimented the model in the "Lonely Planet" tourism domain. The taxonomy, the corpus, and the named entities to classify are the ones proposed in the PASCAL ontology learning and population challenge.
Keywords :
ontologies (artificial intelligence); pattern classification; self-organising feature maps; unsupervised learning; Lonely Planet tourism domain; PASCAL ontology learning; distributional vector space; domain text corpus mining; hierarchical self-organizing map; named entity classification; population challenge; taxonomy; unsupervised model; unsupervised neural network architecture; Computer architecture; Computer science; Data mining; Frequency; Neural networks; Ontologies; Pattern recognition; Planets; Self organizing feature maps; Taxonomy; centroid vector; document category histograms; extended growing hierarchical self-organizing maps; taxonomy enrichment; unsupervised neural network;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence for Modelling Control & Automation, 2008 International Conference on
Conference_Location :
Vienna
Print_ISBN :
978-0-7695-3514-2
Type :
conf
DOI :
10.1109/CIMCA.2008.163
Filename :
5172775
Link To Document :
بازگشت