• DocumentCode
    417134
  • Title

    Effects on transcription errors on supervised learning in speech recognition

  • Author

    Sundaram, Ram ; Picone, Joseph

  • Author_Institution
    Conversay, Redmond, WA, USA
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    Hidden Markov model-based speech recognition systems use supervised learning to train acoustic models. On difficult tasks such as conversational speech, there has been concern over the impact erroneous transcriptions have on the parameter estimation process. This work analyzes the effects of mislabeled data on recognition accuracy. Training is performed using manually corrupted transcriptions, and results are presented on three tasks: TIdigits, alphadigits and switchboard. For alphadigits, with 16% of the training data mislabeled, the performance of the system degrades by 12% relative to the baseline. On switchboard, at 16% mislabeled training data, the performance of the system degrades by 8.5% relative to the baseline. An analysis of these results revealed that the Gaussian mixture model contributes significantly to the robustness of the supervised learning training process.
  • Keywords
    Gaussian distribution; hidden Markov models; learning (artificial intelligence); parameter estimation; speech recognition; Gaussian mixture model; TIdigits; acoustic model training; alphadigits; conversational speech; corrupted transcriptions; hidden Markov models; multivariate Gaussian distribution; parameter estimation process; recognition accuracy mislabeled data effects; speech recognition; supervised learning training process robustness; supervised learning transcription error effects; switchboard; Databases; Degradation; Error analysis; Gaussian distribution; Impedance matching; Parameter estimation; Probability; Robustness; Speech recognition; Supervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1325949
  • Filename
    1325949