مرکز منطقه ای اطلاع رساني علوم و فناوري - Analysis-by-synthesis frame dropping algorithm together with a novel speech recognizer using time-varying hidden Markov model

DocumentCode :

173842

Title :

Analysis-by-synthesis frame dropping algorithm together with a novel speech recognizer using time-varying hidden Markov model

Author :

Lee-Min Lee ; Fu-Rong Jean

Author_Institution :

Dept. of Electr. Eng., Dayeh Univ., Changhua, Taiwan

fYear :

2014

fDate :

5-8 Oct. 2014

Firstpage :

2293

Lastpage :

2298

Abstract :

In distributed speech recognition applications, variable frame rate (VFR) analysis is a technique that can reduce the channel bandwidth and computation resources. In this method, slowly changing frames that provide little information are abandoned. Rapidly changing frames, on the other hand, that are more related to speech perception are preserved. In this paper, we proposed an analysis-by-synthesis (AbS) frame dropping algorithm together with a novel VFR decoding method for hidden Markov modeling of speech. A recursive formula for the calculation of forward probability function of the VFR observations was derived and was used to form a time-varying hidden Markov model (tvHMM) with transition probabilities that are depended on the time difference between successive observations. A generalized Viterbi decoding algorithm was developed to decode the VFR observations. We also use an example to explain the decoding process for a particular VFR observation sequence. Experiments were conducted to investigate the effectiveness of the proposed AbS-tvHMM method. The experimental results show that our method can achieve essentially the same accuracy as full frame rate observations at frame rate of only 40 % and significantly reduces the computation time.

Keywords :

Viterbi decoding; hidden Markov models; probability; recursive estimation; speech coding; speech recognition; time-varying systems; variable rate codes; AbS frame dropping algorithm; AbS-tvHMM method; VFR analysis; VFR decoding method; Viterbi decoding algorithm; analysis-by-synthesis frame dropping algorithm; channel bandwidth reduction; distributed speech recognition applications; forward probability function; hidden Markov speech modeling; recursive formula; speech perception; speech recognizer; time-varying hidden Markov model; transition probabilities; variable frame rate analysis; Accuracy; Algorithm design and analysis; Decoding; Hidden Markov models; Speech; Speech recognition; Viterbi algorithm; Speech recognition; Viterbi algorithm; distributed speech recognition; hidden Markov model (HMM); variable frame rate (VFR);

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Systems, Man and Cybernetics (SMC), 2014 IEEE International Conference on

Conference_Location :

San Diego, CA

Type :

conf

DOI :

10.1109/SMC.2014.6974268

Filename :

6974268

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=173842