DocumentCode :
1990539
Title :
A New Method of Text Categorization on Imbalanced Datasets
Author :
Xin-fu, Li ; Yan, Yu ; Peng, Yin
Author_Institution :
Coll. of Math. & Comput. Sci., Hebei Univ., Baoding
Volume :
2
fYear :
2008
fDate :
21-22 Dec. 2008
Firstpage :
259
Lastpage :
262
Abstract :
This paper aims at improving the categorization performance of the small number of samples in the imbalance datasets, and dealing with data re-sampling from the perspective of data. The main idea is to make the number of various types of texts by increasing some texts. The experiment indicates that the system has improved the accuracy of text-categorization effectively.
Keywords :
data handling; sampling methods; text analysis; data resampling; imbalanced datasets; text categorization; Computer science education; Educational institutions; Educational technology; Machine learning; Mathematics; Pattern recognition; Support vector machine classification; Support vector machines; Testing; Text categorization; SVM; imbalanced dataset; text categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Education Technology and Training, 2008. and 2008 International Workshop on Geoscience and Remote Sensing. ETT and GRS 2008. International Workshop on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3563-0
Type :
conf
DOI :
10.1109/ETTandGRS.2008.42
Filename :
5070355
Link To Document :
بازگشت