DocumentCode :
2892354
Title :
Multilevel Regression Models for Learning in the Presence of Rare Data
Author :
Ravindran, Sriram ; Bahler, D.
Author_Institution :
Dept. of Comput. Sci., North Carolina State Univ., Raleigh, NC, USA
Volume :
2
fYear :
2012
fDate :
12-15 Dec. 2012
Firstpage :
107
Lastpage :
112
Abstract :
Learning with imbalanced datasets has been a major topic of study for many years. In this paper, we focus on a type of imbalance called imbalance due to rare instances. Such imbalances occur in a variety of domains. Rare instances have received less focus in prediction problems and we wish to draw attention to how accuracy can be improved in the presence of rare data. We discuss an approach to regression tasks, where the training instances are first grouped by similarity and a group of heterogeneous models is applied to each of these groups. This approach enables better prediction on unseen or rare instances when compared to existing approaches. We present results that show performance across datasets from different domains. Our approach was found to provide better prediction than common approaches on rare unseen instances without affecting the overall performance on prediction tasks.
Keywords :
data handling; learning (artificial intelligence); regression analysis; machine learning; multilevel regression models; rare data presence; regression tasks; Accuracy; Computational modeling; Data models; Machine learning; Predictive models; Regression tree analysis; Training; Multilevel model; rare examples; regression;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications (ICMLA), 2012 11th International Conference on
Conference_Location :
Boca Raton, FL
Print_ISBN :
978-1-4673-4651-1
Type :
conf
DOI :
10.1109/ICMLA.2012.207
Filename :
6406736
Link To Document :
بازگشت