Title :
Improving Text Classifier Performance based on AUC
Author :
Wong, Alex K S ; Lee, John W T ; Yeung, Daniel S.
Author_Institution :
Dept. of Comput., The Hong Kong Polytech. Univ.,
Abstract :
To evaluate the performance of text classifiers, we usually look at measures related to precision and recall, and most machine learning methods are optimized for these measures. In recent year, the use of receiver operating characteristics (ROC) graph and its extension area under the ROC curve (AUC) in gauging classifier performance has attracted much attention from the machine learning community. This measure is especially useful when a data set is imbalanced or when operating characteristics are unknown. Some researchers have started investigating the optimization of existing learning model for this new performance criterion. In this paper, we proposed modifications to the well-known weight updating text classifier sleeping-experts (SE) for AUC optimization. Our experiments show that through our new sampling and updating strategy we can improve the classifier both in terms of AUC and the traditional performance measures
Keywords :
graph theory; learning (artificial intelligence); optimisation; pattern classification; performance evaluation; text analysis; AUC optimization; machine learning; receiver operating characteristics graph; sleeping-experts; text classifier performance evaluation; Area measurement; Decision trees; Learning systems; Machine learning; Optimization methods; Particle measurements; Performance evaluation; Sampling methods; Support vector machines; Text categorization;
Conference_Titel :
Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2521-0
DOI :
10.1109/ICPR.2006.705