DocumentCode
599194
Title
Building semantic corpus from wordNet
Author
Stanchev, Lubomir
Author_Institution
Indiana Univ. - Purdue Univ. Fort Wayne, Fort Wayne, IN, USA
fYear
2012
fDate
4-7 Oct. 2012
Firstpage
226
Lastpage
231
Abstract
We propose a novel methodology for extracting semantic similarity knowledge from semi-structured sources, such as WordNet. Unlike existing approaches that only explore the structured information (e.g., the hypernym relationship in WordNet), we present a framework that allows us to utilize all available information, including natural language descriptions. Our approach constructs a semantic corpus. It is represented using a graph that models the relationship between phrases using numbers. The data in the semantic corpus can be used to measure the similarity between phrases, the similarity between documents, or to perform a semantic search in a set of documents that uses the meaning of words and phrases (i.e., search that is not keyword-based).
Keywords
feature extraction; graph theory; natural language processing; WordNet; graph; natural language descriptions; semantic corpus; semantic search; semantic similarity knowledge extraction; semistructured sources; Bayesian methods; Correlation; Frequency conversion; Humans; Natural languages; Semantics; Wheelchairs;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedicine Workshops (BIBMW), 2012 IEEE International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
978-1-4673-2746-6
Electronic_ISBN
978-1-4673-2744-2
Type
conf
DOI
10.1109/BIBMW.2012.6470308
Filename
6470308
Link To Document