DocumentCode
3521028
Title
Real-time recognition of subword units on a hybrid multi-DSP/ASIC based acoustic front-end
Author
Aktas, Abdulmesih ; Hoge, Harald
Author_Institution
Siemens AG, Munchen, West Germany
fYear
1989
fDate
23-26 May 1989
Firstpage
101
Abstract
A description is given of the hardware and software structure of the acoustic-phonetic decoding done in real time within the speaker-adaptive continuous speech understanding system SPICOS (Siemens, Philips, IPO continuous speech recognition and understanding). SPICOS is designed as a German language man-machine dialogue interface system consisting of acoustic-phonetic decoding, linguistic analysis, dialogue-modeling, and speech-synthesis modules. The acoustic-phonetic decoding is based on an articulatory feature vector, which is used to recognize subword units with hidden Markov models (HMM). Feature extraction and recognition are supported by special hardware. For the formant extraction, 16 LPC reflection coefficients are calculated by a signal processor and mapped onto a codebook with 4000 codes containing formant hypotheses. The latter task is performed by a dedicated application-specific integrated circuit designed for vector quantization
Keywords
application specific integrated circuits; digital signal processing chips; speech recognition; German language man-machine dialogue interface system; LPC reflection coefficients; SPICOS; acoustic-phonetic decoding; application-specific integrated circuit; articulatory feature vector; codebook; dialogue-modeling; feature extraction; formant extraction; hidden Markov models; hybrid multi-DSP/ASIC based acoustic front-end; linguistic analysis; real time system; speaker-adaptive continuous speech understanding system; speech recognition; speech-synthesis modules; subword units; vector quantization; Application specific integrated circuits; Decoding; Feature extraction; Hardware; Hidden Markov models; Man machine systems; Natural languages; Real time systems; Speech analysis; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
Conference_Location
Glasgow
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.1989.266373
Filename
266373
Link To Document