DocumentCode
310547
Title
A multi-phase approach for fast spotting of large vocabulary Chinese keywords from Mandarin speech using prosodic information
Author
Bai, Bo-Ren ; Tseng, Chiu-Yu ; Lee, Lin-shan
Author_Institution
Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Volume
2
fYear
1997
fDate
21-24 Apr 1997
Firstpage
903
Abstract
This paper presents a multi-phase approach for fast spotting of large vocabulary Chinese keywords from a spontaneous Mandarin speech utterance using prosodic knowledge. Without searching through the whole utterance using large number of keyword models, the multi-phase framework proposed including some special scoring schemes provides very good efficiency by considering the monosyllable-based structure of Mandarin Chinese. This approach is therefore very fast due to very good boundary estimations and the deletion of most impossible syllable and keyword candidates using context independent models, and is also very accurate due to the carefully designed scoring processes. A task with 2611 keywords was tested. An inclusion rate of 85.79% for the top 10 candidates is attained, at a speed requiring only 1.2 times that of the utterance length on a Sparc 20 workstation
Keywords
natural languages; speech recognition; Mandarin Chinese; Sparc 20 workstation; acoustic recognition; boundary estimations; context independent models; efficiency; fast spotting; inclusion rate; keyword models; large vocabulary Chinese keywords; monosyllable based structure; multiphase approach; prosodic information; scoring schemes; spontaneous Mandarin speech utterance; utterance length; Context modeling; Decoding; Hidden Markov models; History; Noise level; Process design; Speech recognition; Testing; Vocabulary; Workstations;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location
Munich
ISSN
1520-6149
Print_ISBN
0-8186-7919-0
Type
conf
DOI
10.1109/ICASSP.1997.596082
Filename
596082
Link To Document