DocumentCode :
3125103
Title :
An improved steady segment based decoding algorithm by using response probability for LVCSR
Author :
Zhanlei Yang ; Wenju Liu ; Hao Chao
Author_Institution :
Nat. Lab. of Pattern Recognition (NLPR), Inst. of Autom., Beijing, China
fYear :
2012
fDate :
5-8 Dec. 2012
Firstpage :
306
Lastpage :
310
Abstract :
This paper proposes a novel decoding algorithm by integrating both steady speech segments and observations´ location information into conventional path extension framework. First, speech segments which possess stable spectrum are extracted. Second, a preliminarily improved algorithm is given by modifying traditional inter-HMM extension framework using the detected steady segments. Then, at probability calculation stage, response probability (RP), which represents location information of observations within acoustic feature space, is further incorporated into decoding. Thus, RP directs the decoder to enhance/weaken path candidates that get through the front end steady-segment-based decoding. Experiments conducted on Mandarin speech recognition show that character error rate of proposed algorithm achieves a 4.6% relative reduction when compared with a system in which only steady segment is used, and run time factor achieves a 10.0% relative reduction when compared with a system in which only RP is used.
Keywords :
decoding; error statistics; hidden Markov models; probability; speech processing; speech recognition; LVCSR; Mandarin speech recognition; acoustic feature space; character error rate; improved steady segment based decoding algorithm; interHMM extension framework; location information representation; observation location information; path extension framework; probability calculation stage; response probability; run time factor; speech segmentation; stable spectrum; steady segment detection; steady speech segments; steady-segment-based decoding; Acoustics; Decoding; Hidden Markov models; Pragmatics; Probability; Speech; Speech recognition; Decoding algorithm; path extension; probability fusion; response probability; steady segment;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
Conference_Location :
Kowloon
Print_ISBN :
978-1-4673-2506-6
Electronic_ISBN :
978-1-4673-2505-9
Type :
conf
DOI :
10.1109/ISCSLP.2012.6423525
Filename :
6423525
Link To Document :
بازگشت