• DocumentCode
    20840
  • Title

    Latent Structured Perceptrons for Large-Scale Learning with Hidden Information

  • Author

    Xu Sun ; Matsuzaki, Takaomi ; Wenjie Li

  • Author_Institution
    Key Lab. of Comput. Linguistics, Peking Univ., Beijing, China
  • Volume
    25
  • Issue
    9
  • fYear
    2013
  • fDate
    Sept. 2013
  • Firstpage
    2063
  • Lastpage
    2075
  • Abstract
    Many real-world data mining problems contain hidden information (e.g., unobservable latent dependencies). We propose a perceptron-style method, latent structured perceptron, for fast discriminative learning of structured classification with hidden information. We also give theoretical analysis and demonstrate good convergence properties of the proposed method. Our method extends the perceptron algorithm for the learning task with hidden information, which can be hardly captured by traditional models. It relies on Viterbi decoding over latent variables, combined with simple additive updates. We perform experiments on one synthetic data set and two real-world structured classification tasks. Compared to conventional nonlatent models (e.g., conditional random fields, structured perceptrons), our method is more accurate on real-world tasks. Compared to existing heavy probabilistic models of latent variables (e.g., latent conditional random fields), our method lowers the training cost significantly (almost one order magnitude faster) yet with comparable or even superior classification accuracy. In addition, experiments demonstrate that the proposed method has good scalability on large-scale problems.
  • Keywords
    Viterbi decoding; data mining; learning (artificial intelligence); pattern classification; probability; Viterbi decoding; convergence properties; data mining problems; discriminative learning task; heavy probabilistic models; hidden information; large-scale learning; latent structured perceptrons; latent variables; nonlatent models; perceptron algorithm; perceptron-style method; real-world structured classification tasks; simple additive updates; synthetic data set; training cost; Computational modeling; Convergence; Estimation; Hidden Markov models; Training; Vectors; Viterbi algorithm; Structured perceptron; convergence analysis; hidden information; large-scale learning; latent variable;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2012.129
  • Filename
    6226404