DocumentCode :
3124331
Title :
A combined measure for text semantic similarity
Author :
Hao-Di Li ; Qing-Cai Chen ; Xiao-Long Wang
Author_Institution :
Shenzhen Grad. Sch., Intell. Comput. Res. Center, Harbin Inst. of Technol., Shenzhen, China
Volume :
04
fYear :
2013
fDate :
14-17 July 2013
Firstpage :
1869
Lastpage :
1873
Abstract :
With the rapid development of artificial intelligence and natural language processing, text similarity calculation has become the core module of many applications such as semantic disambiguation, information retrieval, automatic question answering and data mining etc. Most of the existing semantic similarity algorithms are based on statistical methods or rule based methods that are conducted on ontology dictionaries and some kind of knowledge bases. Wherein the rule-based methods usually use the dictionary, the ontology tree or graph, or the co-occurrence number of attributes, while the statistical methods may choose to use or not use a knowledge base. While a statistical method of using a knowledge base incorporates more comprehensive knowledge and has the capability of reduces knowledge noise, it usually obtains better performance. Nevertheless, due to the imbalanced distribution of different items in a knowledge base, the semantic similarity calculation results for low-frequency words are usually poor.
Keywords :
computational linguistics; statistical distributions; rule based methods; statistical methods; text semantic similarity; text similarity calculation; Abstracts; Correlation; Electronic publishing; Encyclopedias; Semantics; Statistical analysis; Combination of rule and statistical measure; Semantic similarity; Sentence level semantic similarity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2013 International Conference on
Conference_Location :
Tianjin
Type :
conf
DOI :
10.1109/ICMLC.2013.6890900
Filename :
6890900
Link To Document :
بازگشت