DocumentCode :
3302191
Title :
Machine Learning for Keyphrases Extraction Based on Naive Bayesian Classifier
Author :
Wang, Jiabing ; Peng, Hong ; Hu, Jingsong
Author_Institution :
Sch. of Comput. Sci. & Eng., South China Univ. of Technol., Guangzhou
Volume :
1
fYear :
2006
fDate :
Nov. 2006
Firstpage :
815
Lastpage :
818
Abstract :
Keyphrase extraction is a task with many applications in information retrieval, text mining, and natural language processing. In this paper, a keyphrase extraction approach based on the naive Bayesian classifier is proposed. To determine whether a phrase is a keyphrase, the following features of a phrase in a given document are adopted: its term frequency, whether to appear in the title, abstract and headings (subheadings), and its frequency appearing in the paragraphs of the given document. The approach is evaluated by the standard information retrieval metrics of precision and recall. Experiment results show that this approach is very practical: it can achieve high precision and recall; especially the recall it can achieve is over 80 percent
Keywords :
Bayes methods; information retrieval; learning (artificial intelligence); pattern classification; information retrieval metrics; keyphrases extraction; machine learning; naive Bayesian classifier; Application software; Bayesian methods; Computer science; Data mining; Frequency; Information retrieval; Machine learning; Machine learning algorithms; Natural language processing; Text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Security, 2006 International Conference on
Conference_Location :
Guangzhou
Print_ISBN :
1-4244-0605-6
Electronic_ISBN :
1-4244-0605-6
Type :
conf
DOI :
10.1109/ICCIAS.2006.294249
Filename :
4072202
Link To Document :
بازگشت