Title :
Machine Learning for Keyphrases Extraction Based on Naive Bayesian Classifier
Author :
Wang, Jiabing ; Peng, Hong ; Hu, Jingsong
Author_Institution :
Sch. of Comput. Sci. & Eng., South China Univ. of Technol., Guangzhou
Abstract :
Keyphrase extraction is a task with many applications in information retrieval, text mining, and natural language processing. In this paper, a keyphrase extraction approach based on the naive Bayesian classifier is proposed. To determine whether a phrase is a keyphrase, the following features of a phrase in a given document are adopted: its term frequency, whether to appear in the title, abstract and headings (subheadings), and its frequency appearing in the paragraphs of the given document. The approach is evaluated by the standard information retrieval metrics of precision and recall. Experiment results show that this approach is very practical: it can achieve high precision and recall; especially the recall it can achieve is over 80 percent
Keywords :
Bayes methods; information retrieval; learning (artificial intelligence); pattern classification; information retrieval metrics; keyphrases extraction; machine learning; naive Bayesian classifier; Application software; Bayesian methods; Computer science; Data mining; Frequency; Information retrieval; Machine learning; Machine learning algorithms; Natural language processing; Text mining;
Conference_Titel :
Computational Intelligence and Security, 2006 International Conference on
Conference_Location :
Guangzhou
Print_ISBN :
1-4244-0605-6
Electronic_ISBN :
1-4244-0605-6
DOI :
10.1109/ICCIAS.2006.294249