مرکز منطقه ای اطلاع رساني علوم و فناوري - Efficient Multitemplate Learning for Structured Prediction

DocumentCode :

1763497

Title :

Efficient Multitemplate Learning for Structured Prediction

Author :

Qi Mao ; Tsang, Ivor Wai-Hung

Author_Institution :

Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore

Volume :

Issue :

fYear :

2013

fDate :

Feb. 2013

Firstpage :

248

Lastpage :

261

Abstract :

Conditional random fields (CRF) and structural support vector machines (structural SVM) are two state-of-the-art methods for structured prediction that captures the interdependencies among output variables. The success of these methods is attributed to the fact that their discriminative models are able to account for overlapping features on all input observations. These features are usually generated by applying a given set of templates on labeled data, but improper templates may lead to degraded performance. To alleviate this issue, in this paper we propose a novel multiple template learning paradigm to learn structured prediction and the importance of each template simultaneously, so that hundreds of arbitrary templates could be added into the learning model without caution. This paradigm can be formulated as a special multiple kernel learning problem with an exponential number of constraints. Then we introduce an efficient cutting-plane algorithm to solve this problem in the primal and present its convergence. We also evaluate the proposed learning paradigm on two widely studied structured prediction tasks, i.e., sequence labeling and dependency parsing. Extensive experimental results show that the proposed method outperforms CRFs and structural SVMs because of exploiting the importance of each template. Complexity analysis and empirical results also show that the proposed method is more efficient than Online multikernel learning on very sparse and high-dimensional data. We further extend this paradigm for structured prediction using generalized p-block norm regularization with p >; 1, and experiments show competitive performances when p ∈ [1,2).

Keywords :

convergence; data handling; learning (artificial intelligence); random processes; support vector machines; CRF; complexity analysis; conditional random field; convergence; cutting-plane algorithm; dependency parsing; generalized p-block norm regularization; labeled data template; learning model; multitemplate learning; online multikernel learning; sequence labeling; structural SVM; structural support vector machine; structured prediction; Convergence; Feature extraction; Kernel; Labeling; Prediction algorithms; Predictive models; Support vector machines; Dependency parsing; multiple template learning; sequence labeling; structured prediction;

fLanguage :

English

Journal_Title :

Neural Networks and Learning Systems, IEEE Transactions on

Publisher :

ieee

ISSN :

2162-237X

Type :

jour

DOI :

10.1109/TNNLS.2012.2228228

Filename :

6387742

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1763497