DocumentCode :
2338730
Title :
Semantic labeling - unveiling the main components of meaning of free-text
Author :
Zieman, Yuri ; Salas, Ricardo
Author_Institution :
Unveil Technologies, Inc.
fYear :
2001
fDate :
13-15 Nov. 2001
Firstpage :
228
Lastpage :
235
Abstract :
An experimentally proven methodology for computing semantic labels for natural language and its use in semantic processing of text is described. A combinatorial model of the conceptual space is created where semantic labels result as combinations ofprimary or atomic concepts called Semantic Factors. The set of about 2,500 Semantic Factors is defined. The basic semantic element of a language is a morpheme-type element (s-morpheme), the minimalpart ofa language that bears its own meaning. All s-morphemes in the Knowledge Base (about 15,000 for English) are labeled. The label for a phrase (its ??Concept Codel7 results as a combination of the labels for the smorphemes constituting it. Algorithms are designed to identify the s-morphemes in a phrase and to generate the phrase??s Concept Code. The matching procedure compares Concept Codes and identifies conceptually close ones - those sharing a maximal number of Semantic Factors. Similarity is identified here as a match between the Concept Codes of two Text objects. Since a Concept Code is essentially language independent, this technology is appropriate for implementation in cross-language applications. An example is described of an application in the bio-medical domain, where documents of a database of more than 12 million titles are being successfully retrieved in about 50% of the queries normally rejected by traditional search methods.
Keywords :
Algorithm design and analysis; Appropriate technology; Databases; History; Information retrieval; Labeling; Natural languages; Search methods; Space technology; Thesauri;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
String Processing and Information Retrieval, 2001. SPIRE 2001. Proceedings.Eighth International Symposium on
Conference_Location :
Laguna de San Rafael, Chile
Print_ISBN :
0-7695-1192-9
Type :
conf
DOI :
10.1109/SPIRE.2001.989766
Filename :
989766
Link To Document :
بازگشت