• DocumentCode
    2391866
  • Title

    Integrating syllable boundary information into speech recognition

  • Author

    Wu, Su-Lin ; Shire, Michael L. ; Greenberg, Steven ; Morgan, Nelson

  • Author_Institution
    Int. Comput. Sci. Inst., Berkeley, CA, USA
  • Volume
    2
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    987
  • Abstract
    We examine the proposition that knowledge of the timing of syllabic onsets may be useful in improving the performance of speech recognition systems. A method of estimating the location of syllable onsets derived from the analysis of energy trajectories in critical band channels has been developed, and a syllable-based decoder has been designed and implemented that incorporates this onset information into the speech recognition process. For a small, continuous speech recognition task the addition of artificial syllabic onset information (derived from advance knowledge of the word transcriptions) lowers the word error rate by 38%. Incorporating acoustically-derived syllabic onset information reduces the word error rate by 10% on the same task. The latter experiment has highlighted representational issues on coordinating acoustic and lexical syllabifications, a topic we are beginning to explore
  • Keywords
    acoustic signal processing; decoding; parameter estimation; speech processing; speech recognition; timing; acoustic syllabification; artificial syllabic onset information; automatic speech recognition systems; continuous speech recognition task; critical band channels; energy trajectories analysis; experiment; lexical syllabification; syllabic onsets timimg; syllable based decoder; syllable boundary information; syllable onsets location estimation; system performance; word error rate reduction; word transcriptions; Automatic speech recognition; Computer science; Decoding; Error analysis; Filters; Hidden Markov models; Psychology; Speech analysis; Speech processing; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.596105
  • Filename
    596105