• DocumentCode
    395221
  • Title

    Fast discriminative training for sequential observations with application to speaker identification

  • Author

    Li, Qi ; Juang, Büng-Hwang

  • Volume
    2
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    This paper presents a fast discriminative training algorithm for sequences of observations. It considers a sequence of feature vectors as one single composite token in training or testing. In contrast to the traditional EM algorithm, this algorithm is derived from a discriminative objective, aiming at directly minimizing the recognition error. Compared to the gradient-descent algorithms for discriminative training, this algorithm invokes a mild assumption which leads to closed-form formulas for re-estimation, rather than relying on gradient search, without sacrificing the algorithmic rigor. As such, it is in general much faster than a descent based algorithm and does not need to determine the learning rate or step size. Our experiment shows that the proposed algorithm reduces error rate by 14.65, 66.46, and 100.00% for 1, 5, and 10 seconds of testing data respectively, in a speaker identification application.
  • Keywords
    error statistics; feature extraction; minimisation; sequential estimation; speaker recognition; closed-form formulas; composite token; discriminative training algorithm; error rate; feature vectors; re-estimation; recognition error minimization; sequential observations; speaker identification; Decision theory; Error analysis; Error correction; Iterative algorithms; Maximum likelihood estimation; Pattern recognition; Security; Speech recognition; Testing; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1202333
  • Filename
    1202333