Title :
Learning in imbalanced relational data
Author :
Ghanem, Amal S. ; Venkatesh, Svetha ; West, Geoff
Author_Institution :
Dept. of Comput., Curtin Univ. of Technol., Perth, WA, Australia
Abstract :
Traditional learning techniques learn from flat data files with the assumption that each class has a similar number of examples. However, the majority of real-world data are stored as relational systems with imbalanced data distribution, where one class of data is over-represented as compared with other classes. We propose to extend a relational learning technique called Probabilistic Relational Models (PRMs) to deal with the imbalanced class problem. We address learning from imbalanced relational data using an ensemble of PRMs and propose a new model: the PRMs-IM. We show the performance of PRMs-IM on a real university relational database to identify students at risk.
Keywords :
educational administrative data processing; learning (artificial intelligence); probability; relational databases; imbalanced class problem; imbalanced relational data; probabilistic relational model; relational learning technique; student identification; university relational database; Algorithm design and analysis; Bayesian methods; Costs; Decision trees; Inference algorithms; Laboratories; Probability distribution; Regression analysis; Relational databases; Voting;
Conference_Titel :
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location :
Tampa, FL
Print_ISBN :
978-1-4244-2174-9
Electronic_ISBN :
1051-4651
DOI :
10.1109/ICPR.2008.4761095