Real-time recognition of subword units on a hybrid multi-DSP/ASIC based acoustic front-end

Author

Aktas, Abdulmesih ; Hoge, Harald

Author_Institution

Siemens AG, Munchen, West Germany

fYear

1989

fDate

23-26 May 1989

Firstpage

101

Abstract

A description is given of the hardware and software structure of the acoustic-phonetic decoding done in real time within the speaker-adaptive continuous speech understanding system SPICOS (Siemens, Philips, IPO continuous speech recognition and understanding). SPICOS is designed as a German language man-machine dialogue interface system consisting of acoustic-phonetic decoding, linguistic analysis, dialogue-modeling, and speech-synthesis modules. The acoustic-phonetic decoding is based on an articulatory feature vector, which is used to recognize subword units with hidden Markov models (HMM). Feature extraction and recognition are supported by special hardware. For the formant extraction, 16 LPC reflection coefficients are calculated by a signal processor and mapped onto a codebook with 4000 codes containing formant hypotheses. The latter task is performed by a dedicated application-specific integrated circuit designed for vector quantization

Keywords

application specific integrated circuits; digital signal processing chips; speech recognition; German language man-machine dialogue interface system; LPC reflection coefficients; SPICOS; acoustic-phonetic decoding; application-specific integrated circuit; articulatory feature vector; codebook; dialogue-modeling; feature extraction; formant extraction; hidden Markov models; hybrid multi-DSP/ASIC based acoustic front-end; linguistic analysis; real time system; speaker-adaptive continuous speech understanding system; speech recognition; speech-synthesis modules; subword units; vector quantization; Application specific integrated circuits; Decoding; Feature extraction; Hardware; Hidden Markov models; Man machine systems; Natural languages; Real time systems; Speech analysis; Speech recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on

Conference_Location

Glasgow

ISSN

1520-6149

Type

conf

DOI

10.1109/ICASSP.1989.266373

Filename

266373