• DocumentCode
    284602
  • Title

    High performance connected digit recognition using codebook exponents

  • Author

    Cardin, Régis ; Normandin, Yves ; de Mori, Renato

  • Author_Institution
    Centre de Recherche Inf. de Montreal, Que., Canada
  • Volume
    1
  • fYear
    1992
  • fDate
    23-26 Mar 1992
  • Firstpage
    505
  • Abstract
    The authors describe the latest developments by the speech research group at CRIM in speaker-independent connected digit recognition, using hidden Markov models (HMMs) trained with maximum mutual information estimation (MMIE). The work presented is a continuation of work previously described by the authors (see Proc. 1991 IEEE Inf. Conf. on Acoust. Speech and Sign. Process., pp.533-536). The main differences are: (1) use of the 20-kHz TI/NIST corpus available on CD-ROM (instead of the 10-kHz distribution tape), (2) use of word models (instead of sub-word units), (3) addition of second derivative parameters, and (4) a more elaborate training procedure for codebook exponents. The experiments described were all performed on the complete adult portion of the corpus. The baseline system, using discrete HMMs and MMIE, has a 0.67% word error rate and a 2.03% string error rate. The authors describe techniques that allowed them to improve greatly the recognition rate
  • Keywords
    hidden Markov models; information theory; speech coding; speech recognition; CD-ROM; CRIM; HMM; TI/NIST corpus; codebook exponents; connected digit recognition; hidden Markov models; maximum mutual information estimation; second derivative parameters; speaker independent recognition; speech research; training procedure; word models; Availability; CD-ROMs; Cepstral analysis; Error analysis; Hidden Markov models; Mutual information; NIST; Speech recognition; Technological innovation; Topology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0532-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1992.225861
  • Filename
    225861