Building semantic corpus from wordNet

Author

Stanchev, Lubomir

Author_Institution

Indiana Univ. - Purdue Univ. Fort Wayne, Fort Wayne, IN, USA

fYear

2012

fDate

4-7 Oct. 2012

Firstpage

226

Lastpage

231

Abstract

We propose a novel methodology for extracting semantic similarity knowledge from semi-structured sources, such as WordNet. Unlike existing approaches that only explore the structured information (e.g., the hypernym relationship in WordNet), we present a framework that allows us to utilize all available information, including natural language descriptions. Our approach constructs a semantic corpus. It is represented using a graph that models the relationship between phrases using numbers. The data in the semantic corpus can be used to measure the similarity between phrases, the similarity between documents, or to perform a semantic search in a set of documents that uses the meaning of words and phrases (i.e., search that is not keyword-based).

Keywords

feature extraction; graph theory; natural language processing; WordNet; graph; natural language descriptions; semantic corpus; semantic search; semantic similarity knowledge extraction; semistructured sources; Bayesian methods; Correlation; Frequency conversion; Humans; Natural languages; Semantics; Wheelchairs;

fLanguage

English

Publisher

ieee

Conference_Titel

Bioinformatics and Biomedicine Workshops (BIBMW), 2012 IEEE International Conference on

Conference_Location

Philadelphia, PA

Print_ISBN

978-1-4673-2746-6

Electronic_ISBN

978-1-4673-2744-2

Type

conf

DOI

10.1109/BIBMW.2012.6470308

Filename

6470308