Title :
ATREUS: a comparative study of continuous speech recognition systems at ATR
Author :
Nagai, A. ; Yamaguchi, K. ; Sagayama, S. ; Kurematsu, A.
Author_Institution :
ATR Interpreting Telephony Res. Lab., Soraku-gun, Kyoto, Japan
Abstract :
The authors describe ATREUS, an aggregation of a large variety of continuous speech recognition systems, forming the spoken input front-end of an interpreting telephony system. ATREUS includes the following phone models: discrete HMMs (hidden Markov models) with fuzzy vector quantization (VQ) and multiple codebooks; continuous mixture density HMMs; hidden Markov networks derived from the SSS (successive state splitting) algorithm; time-delay-neural networks; and fuzzy partition models. Its speaker modes involve speaker-dependent, speaker-independent, and speaker-adaptive techniques such as codebook mapping for VQ-HMMs, vector field smoothing for all types of HMMs, and neural network speaker mapping. A comparative study is given from the viewpoints of structure, constituent techniques, hardware implementation, and performance. ATREUS was evaluated for Japanese phrase speech recognition. A combination called ATREUS/SSS-LR had the best performance among the ATREUS systems.<>
Keywords :
fuzzy logic; hidden Markov models; neural nets; speech recognition; telephony; vector quantisation; ATREUS; Japanese; codebook mapping; continuous speech recognition systems; fuzzy partition models; fuzzy vector quantization; hardware implementation; hidden Markov models; interpreting telephony system; multiple codebooks; neural network speaker mapping; performance; successive state splitting; time-delay-neural networks; vector field smoothing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location :
Minneapolis, MN, USA
Print_ISBN :
0-7803-7402-9
DOI :
10.1109/ICASSP.1993.319251