• DocumentCode
    1560969
  • Title

    Synthetic phoneme prototypes in a connected-word speech recognition system

  • Author

    Blomberg, Mats

  • Author_Institution
    Dept. of Speech Commun. & Music Acoust., R. Inst. of Technol., Stockholm, Sweden
  • fYear
    1989
  • Firstpage
    687
  • Abstract
    A recognition system based on a reference library of synthetic phoneme prototypes is described. The phoneme templates are specified in terms of formant synthesis parameters. The vocabulary and grammar are described in a finite-state network where each node represents a phoneme. A transition between two phonemes in the net is expanded to a number of new nodes using interpolation on the synthesis parameters or at the spectrum level. For each node, a 16-channel filter bank section is computed from the synthesis parameters. Adaptation to each speaker´s individual voice source spectrum is performed during recognition. Auditory forward masking is incorporated. Speaker-independent recognition results are given for male speakers on isolated words and connected digits. Future improvements include coarticulation and reduction rules and speaker adaptation of phoneme parameters. The method could also be used in combination with hidden Markov models to provide reference data in cases not covered by the training material
  • Keywords
    speech recognition; speech synthesis; 16-channel filter bank; auditory forward masking; connected digits; connected-word speech recognition system; finite-state network; formant synthesis parameters; grammar; hidden Markov models; interpolation; isolated words; phoneme templates; reduction rules; reference library; speaker adaptation; speaker independent recognition; synthetic phoneme prototypes; training material; vocabulary; voice source spectrum; Hidden Markov models; Libraries; Music; Natural languages; Network synthesis; Production systems; Prototypes; Speech recognition; Speech synthesis; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
  • Conference_Location
    Glasgow
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.1989.266520
  • Filename
    266520