• DocumentCode
    323580
  • Title

    Comparison of discriminative training criteria

  • Author

    Schlüter, Ralf ; Macherey, W.

  • Author_Institution
    Lehrstuhl fur Inf. VI, Tech. Hochschule Aachen, Germany
  • Volume
    1
  • fYear
    1998
  • fDate
    12-15 May 1998
  • Firstpage
    493
  • Abstract
    A formally unifying approach for a class of discriminative training criteria including maximum mutual information (MMI) and minimum classification error (MCE) criterion is presented, together with the optimization methods of the gradient descent (GD) and extended Baum-Welch (EB) algorithm. Comparisons are discussed for the MMI and the MCE criterion, including the determination of the sets of word sequence hypotheses for discrimination using word graphs. Experiments have been carried out on the SieTill corpus for telephone line recorded German continuous digit strings. Using several approaches for acoustic modeling, the word error rates obtained by MMI training using single densities always were better than those for maximum likelihood (ML) using mixture densities. Finally, the results obtained for corrective training (CT), i.e. using only the best recognized word sequence in addition to the spoken word sequence, could not be improved by using the word graph based discriminative training
  • Keywords
    acoustic signal processing; error statistics; information theory; optimisation; pattern classification; speech recognition; German continuous digit strings; MMI training; SieTill corpus; acoustic modeling; corrective training; discriminative training criteria; extended Baum-Welch algorithm; gradient descent; maximum mutual information; minimum classification error; mixture densities; optimization methods; parameter optimisation; recognized word sequence; spoken word sequence; telephone line recorded digit strings; word error rates; word graphs; word sequence hypotheses; Acoustic emission; Convergence; Error analysis; Linear discriminant analysis; Mutual information; Optimization methods; Smoothing methods; Telephony; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
  • Conference_Location
    Seattle, WA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-4428-6
  • Type

    conf

  • DOI
    10.1109/ICASSP.1998.674475
  • Filename
    674475