• DocumentCode
    2019969
  • Title

    ATREUS: a comparative study of continuous speech recognition systems at ATR

  • Author

    Nagai, A. ; Yamaguchi, K. ; Sagayama, S. ; Kurematsu, A.

  • Author_Institution
    ATR Interpreting Telephony Res. Lab., Soraku-gun, Kyoto, Japan
  • Volume
    2
  • fYear
    1993
  • fDate
    27-30 April 1993
  • Firstpage
    139
  • Abstract
    The authors describe ATREUS, an aggregation of a large variety of continuous speech recognition systems, forming the spoken input front-end of an interpreting telephony system. ATREUS includes the following phone models: discrete HMMs (hidden Markov models) with fuzzy vector quantization (VQ) and multiple codebooks; continuous mixture density HMMs; hidden Markov networks derived from the SSS (successive state splitting) algorithm; time-delay-neural networks; and fuzzy partition models. Its speaker modes involve speaker-dependent, speaker-independent, and speaker-adaptive techniques such as codebook mapping for VQ-HMMs, vector field smoothing for all types of HMMs, and neural network speaker mapping. A comparative study is given from the viewpoints of structure, constituent techniques, hardware implementation, and performance. ATREUS was evaluated for Japanese phrase speech recognition. A combination called ATREUS/SSS-LR had the best performance among the ATREUS systems.<>
  • Keywords
    fuzzy logic; hidden Markov models; neural nets; speech recognition; telephony; vector quantisation; ATREUS; Japanese; codebook mapping; continuous speech recognition systems; fuzzy partition models; fuzzy vector quantization; hardware implementation; hidden Markov models; interpreting telephony system; multiple codebooks; neural network speaker mapping; performance; successive state splitting; time-delay-neural networks; vector field smoothing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
  • Conference_Location
    Minneapolis, MN, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1993.319251
  • Filename
    319251