• DocumentCode
    431954
  • Title

    Space alternating data augmentation: application to finite mixture of Gaussians and speaker recognition

  • Author

    Doucet, Arnaud ; Sénécal, Stéphane ; Matsui, Tomoko

  • Author_Institution
    Dept. of Eng., Cambridge Univ., UK
  • Volume
    4
  • fYear
    2005
  • fDate
    18-23 March 2005
  • Abstract
    The SAGE (space-alternating generalized expectation-maximization) algorithm (Celeux, G. et al., 2001) is one of the most elegant and popular extensions of the EM (expectation maximization) algorithm for performing ML (maximum likelihood) or MAP (maximum a posteriori) parameter estimation. This algorithm updates parameter components by subblocks by alternating missing data spaces. Its efficiency has been reported in numerous simulation studies. We propose here an MCMC (Markov chain Monte Carlo) strategy named SADA (space-alternating data augmentation) which relies on the same principle in order to sample efficiently from (posterior) distributions and we discuss its application to finite mixtures of Gaussians. For this model, we also present an original implementation of the SAGE algorithm. In Monte Carlo simulations and in an application for speaker recognition, these methods, which are straightforward modifications of the standard EM and DA (data augmentation) algorithms, consistently outperform them.
  • Keywords
    Gaussian processes; Markov processes; Monte Carlo methods; maximum likelihood estimation; optimisation; signal processing; speaker recognition; MAP estimation; ML estimation; Markov chain Monte Carlo strategy; SAGE algorithm; finite Gaussian mixture; maximum a posteriori estimation; maximum likelihood estimation; parameter estimation; space-alternating data augmentation; space-alternating generalized expectation-maximization algorithm; speaker recognition; Bayesian methods; Convergence; Gaussian distribution; Gaussian processes; Inference algorithms; Maximum likelihood estimation; Monte Carlo methods; Parameter estimation; Sampling methods; Speaker recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8874-7
  • Type

    conf

  • DOI
    10.1109/ICASSP.2005.1416108
  • Filename
    1416108