• DocumentCode
    599194
  • Title

    Building semantic corpus from wordNet

  • Author

    Stanchev, Lubomir

  • Author_Institution
    Indiana Univ. - Purdue Univ. Fort Wayne, Fort Wayne, IN, USA
  • fYear
    2012
  • fDate
    4-7 Oct. 2012
  • Firstpage
    226
  • Lastpage
    231
  • Abstract
    We propose a novel methodology for extracting semantic similarity knowledge from semi-structured sources, such as WordNet. Unlike existing approaches that only explore the structured information (e.g., the hypernym relationship in WordNet), we present a framework that allows us to utilize all available information, including natural language descriptions. Our approach constructs a semantic corpus. It is represented using a graph that models the relationship between phrases using numbers. The data in the semantic corpus can be used to measure the similarity between phrases, the similarity between documents, or to perform a semantic search in a set of documents that uses the meaning of words and phrases (i.e., search that is not keyword-based).
  • Keywords
    feature extraction; graph theory; natural language processing; WordNet; graph; natural language descriptions; semantic corpus; semantic search; semantic similarity knowledge extraction; semistructured sources; Bayesian methods; Correlation; Frequency conversion; Humans; Natural languages; Semantics; Wheelchairs;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine Workshops (BIBMW), 2012 IEEE International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    978-1-4673-2746-6
  • Electronic_ISBN
    978-1-4673-2744-2
  • Type

    conf

  • DOI
    10.1109/BIBMW.2012.6470308
  • Filename
    6470308