Title :
Hybrid Generative/Discriminative Approaches for Proportional Data Modeling and Classification
Author_Institution :
Concordia Inst. for Inf. Syst. Eng. (CIISE), Concordia Univ., Montreal, QC, Canada
Abstract :
The work proposed in this paper is motivated by the need to develop powerful models and approaches to classify and learn proportional data. Indeed, an abundance of interesting data in several applications occur naturally in this form. Our goal is to discover and capture the intrinsic nature of the data by proposing some approaches that combine the major advantages of generative models namely finite mixtures and discriminative techniques namely support vector machines (SVMs). Indeed, SVMs often rely on classic kernels which are not generally meaningful for proportional data. One serious limitation of these kernels is that they do not take into account the nature of data to classify and choosing a suitable kernel continues to be a formidable challenge for data mining and machine learning researchers. Our approach builds on selecting accurate kernels generated from finite mixtures of Dirichlet, generalized Dirichlet and Beta-Liouville distributions which chief advantage is their flexibility and explanatory capabilities in the case of heterogenous proportional data. Using extensive simulations and a number of experiments involving scene modeling and classification, and automatic image orientation detection, we show the merits of the proposed mixture models and the accuracy of the generated kernels.
Keywords :
data mining; image classification; learning (artificial intelligence); object detection; statistical distributions; support vector machines; Beta-Liouville distributions; SVM; automatic image orientation detection; classic kernels; data mining; discriminative techniques; generalized Dirichlet finite mixtures; hybrid generative-discriminative approach; kernel selection; machine learning researche; proportional data classification; proportional data learning; proportional data modeling; scene classification; scene modeling; support vector machines; Data models; Hidden Markov models; Machine learning; Support vector machine classification; Dirichlet; Generative/discriminative learning; Liouville; SVMs; finite mixture models; generalized Dirichlet; image orientation; kernels; model selection; proportional data; scene classification;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2011.162