Title :
A Novel Approach to Find Semantic Similarity Measure between Words
Author :
Sahni, Lakshay ; Sehgal, Anubhav ; Kochar, Shaivi ; Ahmad, Faiyaz ; Ahmad, Tanvir
Author_Institution :
Dept. of Comput. Eng., Jamia Millia Islamia, New Delhi, India
Abstract :
Measuring the semantic similarity between words is a significant feature of the web mining domain. The notion of semantic similarity finds applications in various horizons such as relation extraction, community mining, document clustering, and automatic meta data extraction. This paper introduces a method for measuring the semantic similarity of English words. It combines web search engine based similarity measures namely page counts and probability measure from text snippets with the lexical taxonomy based measures of similarity. The adopted measures are employed and learned using support vector machines. The proposed method is successful in achieving a competent accuracy for the said purpose.
Keywords :
Internet; data mining; natural language processing; search engines; support vector machines; word processing; English words; Web mining domain; Web search engine based similarity measures; automatic meta data extraction; community mining; document clustering; lexical taxonomy based measures; page counts; probability measure; relation extraction; semantic similarity measure; support vector machines; text snippets; Engines; Q measurement; Search engines; Semantics; Support vector machines; Taxonomy; Web search; Page Counts; Probability; Web Mining; WordNet;
Conference_Titel :
Computational and Business Intelligence (ISCBI), 2014 2nd International Symposium on
Print_ISBN :
978-1-4799-7551-8
DOI :
10.1109/ISCBI.2014.26