• DocumentCode
    3540200
  • Title

    Active learning for large-scale factor analysis

  • Author

    Silva, Jorge ; Carin, Lawrence

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Duke Univ., Durham, NC, USA
  • fYear
    2012
  • fDate
    5-8 Aug. 2012
  • Firstpage
    161
  • Lastpage
    164
  • Abstract
    A method for Bayesian factor analysis (FA) of large matrices is proposed. It is assumed that a small number of matrix elements are initially observed, and the statistical FA model is employed to actively and sequentially select which new matrix entries would be most informative, in order to estimate the remaining missing entries, i.e., complete the matrix. The model inference and active learning are performed within an online variational Bayes (VB) framework. A fast and provably near-optimal greedy algorithm is used to sequentially maximize the mutual information contribution from new observations, taking advantage of submodularity properties. Additionally, a simple alternative procedure is proposed, in which the posterior parameters learned by the Bayesian approach are directly used. This alternative procedure is shown to achieve slightly higher prediction error, but requires much fewer computational resources. The methods are demonstrated on a very large matrix factorization problem, namely the Yahoo! Music ratings dataset.
  • Keywords
    belief networks; greedy algorithms; learning (artificial intelligence); matrix decomposition; music; statistical analysis; Bayesian approach; Bayesian factor analysis; Yahoo!; active learning; computational resources; large-scale factor analysis; matrix factorization problem; music ratings dataset; mutual information contribution; near-optimal greedy algorithm; online variational Bayes framework; posterior parameters; statistical FA model; Approximation algorithms; Bayesian methods; Collaboration; Computational modeling; Greedy algorithms; Kernel; Mutual information; Online learning; matrix factorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Statistical Signal Processing Workshop (SSP), 2012 IEEE
  • Conference_Location
    Ann Arbor, MI
  • ISSN
    pending
  • Print_ISBN
    978-1-4673-0182-4
  • Electronic_ISBN
    pending
  • Type

    conf

  • DOI
    10.1109/SSP.2012.6319648
  • Filename
    6319648