• DocumentCode
    388414
  • Title

    A comparison of learning techniques in speech recognition

  • Author

    Bradshaw, Gary L. ; Cole, Ron ; Li, Zongge

  • Author_Institution
    Carnegie-Mellon university, Pittsburgh, Pa.
  • Volume
    7
  • fYear
    1982
  • fDate
    30072
  • Firstpage
    554
  • Lastpage
    557
  • Abstract
    Template-based recognition systems overcome errors in the short-term matching process by comparing whole sequences of acoustic events. In many vocabularies, each word has a highly distinctive sequence. Some vocabularies have confusable words with very similar sequences, leading to poor recognition performance. Improvements in discriminability among similar words may be achieved by altering the matching algorithm, or by improving the reference template set. Both techniques are instances of multi-exemplar learning techniques which improve recognition performance through automatic evaluation of training data. This paper examines several such techniques using isolated utterances and highly ambiguous vocabularies (e.g., the "E" set; 3 B C D E G P V T Z) in a speaker-dependent recognition system. A system which combined both featural and template information led to the best performance for six out of eight speakers. Using this technique, E-set error rates improved from 37% to 10%.
  • Keywords
    Computer errors; Computer science; Data mining; Error analysis; Monitoring; Speech processing; Speech recognition; Training data; US Department of Defense; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '82.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1982.1171636
  • Filename
    1171636