• DocumentCode
    3521028
  • Title

    Real-time recognition of subword units on a hybrid multi-DSP/ASIC based acoustic front-end

  • Author

    Aktas, Abdulmesih ; Hoge, Harald

  • Author_Institution
    Siemens AG, Munchen, West Germany
  • fYear
    1989
  • fDate
    23-26 May 1989
  • Firstpage
    101
  • Abstract
    A description is given of the hardware and software structure of the acoustic-phonetic decoding done in real time within the speaker-adaptive continuous speech understanding system SPICOS (Siemens, Philips, IPO continuous speech recognition and understanding). SPICOS is designed as a German language man-machine dialogue interface system consisting of acoustic-phonetic decoding, linguistic analysis, dialogue-modeling, and speech-synthesis modules. The acoustic-phonetic decoding is based on an articulatory feature vector, which is used to recognize subword units with hidden Markov models (HMM). Feature extraction and recognition are supported by special hardware. For the formant extraction, 16 LPC reflection coefficients are calculated by a signal processor and mapped onto a codebook with 4000 codes containing formant hypotheses. The latter task is performed by a dedicated application-specific integrated circuit designed for vector quantization
  • Keywords
    application specific integrated circuits; digital signal processing chips; speech recognition; German language man-machine dialogue interface system; LPC reflection coefficients; SPICOS; acoustic-phonetic decoding; application-specific integrated circuit; articulatory feature vector; codebook; dialogue-modeling; feature extraction; formant extraction; hidden Markov models; hybrid multi-DSP/ASIC based acoustic front-end; linguistic analysis; real time system; speaker-adaptive continuous speech understanding system; speech recognition; speech-synthesis modules; subword units; vector quantization; Application specific integrated circuits; Decoding; Feature extraction; Hardware; Hidden Markov models; Man machine systems; Natural languages; Real time systems; Speech analysis; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
  • Conference_Location
    Glasgow
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.1989.266373
  • Filename
    266373