Semantic distance acquisition in SemaCS

Author

Sjachyn, Maxym ; Beus-Dukic, Ljerka

Author_Institution

School of Electronics and Computer Science, University of Westminster, London, United Kingdom

fYear

2010

fDate

19-21 May 2010

Firstpage

183

Lastpage

190

Abstract

Search functionality and technology is a growing area of research. However, simple search approaches are still frequently used. A simple keyword or thesauri-based search is efficient and can be easily scaled. However, keyword-based search cannot be used to infer what may or may not be relevant to the user and thesauri, or any other expert generated model, is expensive to produce and tends to be of limited applicability. Semantic Component Selection (SemaCS) approach is not tied to any specific domain and does not rely on expert input. SemaCS is based on actual data and statistical semantic distances between words. Information on semantic distances is used for searching and for automated generation of domain model taxonomy. This paper presents SemaCS´s means of acquiring these semantic distances - mNGD (2) - and its initial evaluation.

Keywords

Artificial intelligence; Competitive intelligence; Computer science; Dictionaries; Indexing; Ontologies; Search engines; Taxonomy; Thesauri; World Wide Web; clustering measure of similarity; indexing; knowledge discovery; semantic distance; taxonomy generation;

fLanguage

English

Publisher

ieee

Conference_Titel

Research Challenges in Information Science (RCIS), 2010 Fourth International Conference on

Conference_Location

Nice, France

ISSN

2151-1349

Print_ISBN

978-1-4244-4839-5

Electronic_ISBN

2151-1349

Type

conf

DOI

10.1109/RCIS.2010.5507378

Filename

5507378