DocumentCode :
244915
Title :
Neural Conditional Energy Models for Multi-label Classification
Author :
How Jing ; Shou-De Lin
Author_Institution :
Dept. of Comput. Sci., Nat. Taiwan Univ., Taipei, Taiwan
fYear :
2014
fDate :
14-17 Dec. 2014
Firstpage :
240
Lastpage :
249
Abstract :
Multi-label classification (MLC) is a type of structured output prediction problems where a given instance can be associated to more than one labels at a time. From the probabilistic point of view, a model predicts a set of labels y given an input vector v by learning a conditional distribution p(y|v). This paper presents a powerful model called a Neural Conditional Energy Model (NCEM) to solve MLC. The model can be viewed as a hybrid deterministic-stochastic network of which we use a deterministic neural network to transform the input data, before contributing to the energy landscape of v, y, and a single stochastic hidden layer h. Non-linear transformation given by the neural network makes our model more expressive and more capable of capturing complex relations between input and output, and using deterministic neurons facilitates exact inference. We present an efficient learning algorithm that is simple to implement. We conduct extensive experiments on 15 real-world datasets from wide variety of domains with various evaluation metrics to confirm that NCEM is significantly superior to current state-of-the-art models most of the time based on pair-wise t-test at 5% significance level. The MATLAB source code to replicate our experiments are available at https://github.com/Kublai-Jing/NCEM.
Keywords :
learning (artificial intelligence); mathematics computing; neural nets; pattern classification; probability; stochastic processes; vectors; MLC; Matlab source code; NCEM; conditional distribution; deterministic neural network; multilabel classification; neural conditional energy models; nonlinear transformation; probabilistic point of view; stochastic hidden layer; vector; Computational modeling; Correlation; Data models; Mathematical model; Stochastic processes; Training; Vectors; Multi-Label Classification; Probabilistic Modeling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2014 IEEE International Conference on
Conference_Location :
Shenzhen
ISSN :
1550-4786
Print_ISBN :
978-1-4799-4303-6
Type :
conf
DOI :
10.1109/ICDM.2014.39
Filename :
7023341
Link To Document :
بازگشت