• DocumentCode
    593493
  • Title

    A survey on semantic similarity between words in semantic web

  • Author

    Ilakiya, P. ; Sumathi, M. ; Karthik, S.

  • Author_Institution
    D epartment of Comput. Sci. Eng., SNS Coll. of Technol., Coimbatore, India
  • fYear
    2012
  • fDate
    21-22 Dec. 2012
  • Firstpage
    213
  • Lastpage
    216
  • Abstract
    Measuring the semantic similarity between words is an important component in various tasks on the web such as relation extraction, community mining, document clustering, and automatic metadata extraction. Despite the usefulness of semantic similarity measures in these applications, accurately measuring semantic similarity between two words (or entities) remains a challenging task. This survey propose an empirical method to estimate semantic similarity using page counts and text snippets retrieved from a web search engine for two words. Specifically, this technique defines various word co-occurrence measures using page counts and integrates those with lexical patterns extracted from text snippets. To identify the numerous semantic relations that exist between two given words, a novel pattern extraction algorithm and a pattern clustering algorithm are proposed. The optimal combination of page counts-based co-occurrence measures and lexical pattern clusters is learned using support vector machines.
  • Keywords
    pattern clustering; search engines; semantic Web; support vector machines; text analysis; Web search engine; automatic metadata extraction; community mining; document clustering; lexical pattern cluster; page counts-based cooccurrence measure; pattern clustering algorithm; pattern extraction algorithm; relation extraction; semantic relation; semantic similarity estimation; semantic similarity measure; semantic web; support vector machine; text snippet; word cooccurrence measure; Databases; Educational institutions; Search engines; Semantic Web; Semantics; Support vector machines; Web search; Page Count; Pattern Clustering; Relation Extraction; Semantic Similarity; Snippet;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Radar, Communication and Computing (ICRCC), 2012 International Conference on
  • Conference_Location
    Tiruvannamalai
  • Print_ISBN
    978-1-4673-2756-5
  • Type

    conf

  • DOI
    10.1109/ICRCC.2012.6450580
  • Filename
    6450580