• DocumentCode
    311018
  • Title

    Microsegment-based connected digit recognition

  • Author

    Godfrey, J.J. ; Ganapathiraju, Aravind ; Ramalingam, C.S. ; Picone, Joseph

  • Author_Institution
    Texas Instrum. Inc., Dallas, TX, USA
  • Volume
    3
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    1755
  • Abstract
    By building acoustic phonetic models which explicitly represent as much knowledge of pronunciation in a small domain (the digits) as possible, we can create a recognition system which not only performs well but allows for meaningful error analysis and improvement. An HMM-based recognizer for the digits and a few associated words was constructed in accord with these principles. About 65 phonetic models were trained on 140 carefully labeled utterances, then iteratively trained on unlabeled data under orthographic supervision. The basic system achieved less than 3% word error rate on digit strings of unknown length from unseen test speakers, and 1.4% on 7-digit strings of known length. This is competitive with word-based models using the same HMM engine and similar parameter settings. As an R&D system, it allows meaningful analysis of errors and relatively straightforward means of improvement
  • Keywords
    error analysis; hidden Markov models; iterative methods; learning (artificial intelligence); speech recognition; 7-digit strings; HMM-based recognizer; acoustic phonetic models; error analysis; improvement; iterative training; labeled utterances; microsegment-based connected digit recognition; orthographic supervision; phonetic models; pronunciation; recognition system; unlabeled data; word error rate; Context modeling; Decoding; Engines; Error analysis; Hidden Markov models; Instruments; Loudspeakers; Speech recognition; System testing; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.598864
  • Filename
    598864