• DocumentCode
    417279
  • Title

    Discriminative feature transformation by guided discriminative training

  • Author

    Hsiao, Roger ; Mak, Brian

  • Author_Institution
    Dept. of Comput. Sci., Hong Kong Univ. of Sci. & Technol., China
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    In this paper, we investigate guided discriminative training in the context of improving multi-class classification problems. We are interested in applications that require improvement in the classification performance of only a subset of the classes at the possible expense of poorer classification performance of the remaining classes. However, should the classification of the remaining classes deteriorate, it is guaranteed not to be worse than the extent that the user specifies. The problem is formulated as a nonlinear programming problem, which can be translated to a unconstrained nonlinear optimization problem using the barrier method that, in turn, can be solved by the gradient descent method. To prove the concept, we apply guided discriminative training to derive an optimal linear transformation on the mel-filterbank log power spectra to improve TIMIT phoneme classification. Encouraging results are obtained.
  • Keywords
    cepstral analysis; digital filters; feature extraction; gradient methods; nonlinear programming; pattern classification; speech processing; speech recognition; TIMIT phoneme classification; barrier method; discriminative feature transformation; gradient descent method; guided discriminative training; mel-filterbank log power spectra; multi-class classification problems; nonlinear programming problem; optimal linear transformation; speech recognition; unconstrained nonlinear optimization problem; Acoustic applications; Adaptation model; Application software; Computer errors; Computer science; Discrete cosine transforms; Feature extraction; Gas discharge devices; Linear discriminant analysis; Optimization methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326131
  • Filename
    1326131