DocumentCode :
2474962
Title :
Learning in imbalanced relational data
Author :
Ghanem, Amal S. ; Venkatesh, Svetha ; West, Geoff
Author_Institution :
Dept. of Comput., Curtin Univ. of Technol., Perth, WA, Australia
fYear :
2008
fDate :
8-11 Dec. 2008
Firstpage :
1
Lastpage :
4
Abstract :
Traditional learning techniques learn from flat data files with the assumption that each class has a similar number of examples. However, the majority of real-world data are stored as relational systems with imbalanced data distribution, where one class of data is over-represented as compared with other classes. We propose to extend a relational learning technique called Probabilistic Relational Models (PRMs) to deal with the imbalanced class problem. We address learning from imbalanced relational data using an ensemble of PRMs and propose a new model: the PRMs-IM. We show the performance of PRMs-IM on a real university relational database to identify students at risk.
Keywords :
educational administrative data processing; learning (artificial intelligence); probability; relational databases; imbalanced class problem; imbalanced relational data; probabilistic relational model; relational learning technique; student identification; university relational database; Algorithm design and analysis; Bayesian methods; Costs; Decision trees; Inference algorithms; Laboratories; Probability distribution; Regression analysis; Relational databases; Voting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location :
Tampa, FL
ISSN :
1051-4651
Print_ISBN :
978-1-4244-2174-9
Electronic_ISBN :
1051-4651
Type :
conf
DOI :
10.1109/ICPR.2008.4761095
Filename :
4761095
Link To Document :
بازگشت