• DocumentCode
    2775233
  • Title

    Parallel learning of large-scale multi-label classification problems with min-max modular LIBLINEAR

  • Author

    Chen, Yangyang ; Lu, Bao-Liang ; Zhao, Hai

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., Shanghai, China
  • fYear
    2012
  • fDate
    10-15 June 2012
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    The study on pattern classification trends to be towards large-scale, multi-label, and imbalanced problems. The amount of the data which need to be classified is typically dozens of millions and it keeps rapid increasing in recent years. Traditional pattern classification approaches are inefficient and even ineffective in this situation. In our previous work, we proposed a min-max modular (M3) network for dealing with large-scale and imbalanced problems. M3-network is a generalized modular learning framework and includes three main steps: decomposing a large-scale problem into several smaller independent sub-problems, learning these sub-problems in parallel, and combining the results of the sub-problems to generate a solution to the original problem. In this paper, we embed LIBLINEAR into M3-network (M3-liblnear) to deal with large-scale, multi-label, and imbanlanced pattern classification problems. LIBLINEAR is a fast implementation of a linear classifier. M3-Liblinear uses LIBLINEAR as a base classifier to learn each of the sub-problems. We compare M3-Liblinear with Liblinear-cdblock on a large-scale Japanese patent classification problem. Experimental results demonstrate that M3-Liblinear is superior to Liblinear-cdblock in both training time and generalization performance.
  • Keywords
    generalisation (artificial intelligence); learning (artificial intelligence); minimax techniques; pattern classification; Japanese patent classification problem; M3-liblnear; M3-network; generalization performance; generalized modular learning framework; imbalanced problem; large-scale problem decomposition; min-max modular liblinear; min-max modular network; multilabel classification problem; multilabel problem; parallel learning; pattern classification; training time; Accuracy; Indexes; Minimization; Patents; Training; Training data; Vectors; Liblinear-cdblock; Min-max modular network; imbalanced problem; multi-label problem;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), The 2012 International Joint Conference on
  • Conference_Location
    Brisbane, QLD
  • ISSN
    2161-4393
  • Print_ISBN
    978-1-4673-1488-6
  • Electronic_ISBN
    2161-4393
  • Type

    conf

  • DOI
    10.1109/IJCNN.2012.6252679
  • Filename
    6252679