مرکز منطقه ای اطلاع رساني علوم و فناوري - Keyword extraction for text categorization

DocumentCode :

1947625

Title :

Keyword extraction for text categorization

Author :

An, Jiyuan ; Chen, Yi-Ping Phoebe

Author_Institution :

Sch. of Inf. Technol., Deakin Univ., Melbourne, Vic., Australia

fYear :

2005

fDate :

19-21 May 2005

Firstpage :

556

Lastpage :

561

Abstract :

Text categorization (TC) is one of the main applications of machine learning. Many methods have been proposed, such as Rocchio method, Naive bayes based method, and SVM based text classification method. These methods learn labeled text documents and then construct a classifier. A new coming text document´s category can be predicted. However, these methods do not give the description of each category. In the machine learning field, there are many concept learning algorithms, such as, ID3 and CN2. This paper proposes a more robust algorithm to induce concepts from training examples, which is based on enumeration of all possible keywords combinations. Experimental results show that the rules produced by our approach have more precision and simplicity than that of other methods.

Keywords :

Bayes methods; classification; learning (artificial intelligence); support vector machines; text analysis; vocabulary; CN2 learning algorithm; ID3 concept learning algorithm; Naive bayes method; Rocchio method; SVM; keywords extraction; machine learning; text classification; text document categorization; Australia Council; Bioinformatics; Data mining; Information technology; Machine learning; Machine learning algorithms; Robustness; Support vector machine classification; Support vector machines; Text categorization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Active Media Technology, 2005. (AMT 2005). Proceedings of the 2005 International Conference on

Print_ISBN :

0-7803-9035-0

Type :

conf

DOI :

10.1109/AMT.2005.1505422

Filename :

1505422

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1947625