• DocumentCode
    310547
  • Title

    A multi-phase approach for fast spotting of large vocabulary Chinese keywords from Mandarin speech using prosodic information

  • Author

    Bai, Bo-Ren ; Tseng, Chiu-Yu ; Lee, Lin-shan

  • Author_Institution
    Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
  • Volume
    2
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    903
  • Abstract
    This paper presents a multi-phase approach for fast spotting of large vocabulary Chinese keywords from a spontaneous Mandarin speech utterance using prosodic knowledge. Without searching through the whole utterance using large number of keyword models, the multi-phase framework proposed including some special scoring schemes provides very good efficiency by considering the monosyllable-based structure of Mandarin Chinese. This approach is therefore very fast due to very good boundary estimations and the deletion of most impossible syllable and keyword candidates using context independent models, and is also very accurate due to the carefully designed scoring processes. A task with 2611 keywords was tested. An inclusion rate of 85.79% for the top 10 candidates is attained, at a speed requiring only 1.2 times that of the utterance length on a Sparc 20 workstation
  • Keywords
    natural languages; speech recognition; Mandarin Chinese; Sparc 20 workstation; acoustic recognition; boundary estimations; context independent models; efficiency; fast spotting; inclusion rate; keyword models; large vocabulary Chinese keywords; monosyllable based structure; multiphase approach; prosodic information; scoring schemes; spontaneous Mandarin speech utterance; utterance length; Context modeling; Decoding; Hidden Markov models; History; Noise level; Process design; Speech recognition; Testing; Vocabulary; Workstations;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.596082
  • Filename
    596082