Title :
Parallel learning of large-scale multi-label classification problems with min-max modular LIBLINEAR
Author :
Chen, Yangyang ; Lu, Bao-Liang ; Zhao, Hai
Author_Institution :
Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., Shanghai, China
Abstract :
The study on pattern classification trends to be towards large-scale, multi-label, and imbalanced problems. The amount of the data which need to be classified is typically dozens of millions and it keeps rapid increasing in recent years. Traditional pattern classification approaches are inefficient and even ineffective in this situation. In our previous work, we proposed a min-max modular (M3) network for dealing with large-scale and imbalanced problems. M3-network is a generalized modular learning framework and includes three main steps: decomposing a large-scale problem into several smaller independent sub-problems, learning these sub-problems in parallel, and combining the results of the sub-problems to generate a solution to the original problem. In this paper, we embed LIBLINEAR into M3-network (M3-liblnear) to deal with large-scale, multi-label, and imbanlanced pattern classification problems. LIBLINEAR is a fast implementation of a linear classifier. M3-Liblinear uses LIBLINEAR as a base classifier to learn each of the sub-problems. We compare M3-Liblinear with Liblinear-cdblock on a large-scale Japanese patent classification problem. Experimental results demonstrate that M3-Liblinear is superior to Liblinear-cdblock in both training time and generalization performance.
Keywords :
generalisation (artificial intelligence); learning (artificial intelligence); minimax techniques; pattern classification; Japanese patent classification problem; M3-liblnear; M3-network; generalization performance; generalized modular learning framework; imbalanced problem; large-scale problem decomposition; min-max modular liblinear; min-max modular network; multilabel classification problem; multilabel problem; parallel learning; pattern classification; training time; Accuracy; Indexes; Minimization; Patents; Training; Training data; Vectors; Liblinear-cdblock; Min-max modular network; imbalanced problem; multi-label problem;
Conference_Titel :
Neural Networks (IJCNN), The 2012 International Joint Conference on
Conference_Location :
Brisbane, QLD
Print_ISBN :
978-1-4673-1488-6
Electronic_ISBN :
2161-4393
DOI :
10.1109/IJCNN.2012.6252679