DocumentCode :
2642252
Title :
N-best vector quantization for isolated word speech recognition
Author :
Nose, Masaya ; Maki, Shuichi ; Yamane, Nobumoto ; Morikawa, Yoshitaka
Author_Institution :
Okayama Univ., Okayama
fYear :
2007
fDate :
17-20 Sept. 2007
Firstpage :
2058
Lastpage :
2063
Abstract :
Speech recognition is performed by utilizing acoustic and linguistic model. The contribution of this paper is improvement of acoustic model. Acoustic model is constructed by hidden Markov model (HMM). HMM has two representations, that are discrete HMM and continuous HMM. The former uses vector quantization (VQ), whereas the latter uses functions such as (mixture) Gaussian distribution. In Viterbi algorithm, VQ has advantage that it only operates by addition. However VQ also has a problem of distortion. This paper attempts to improve recognition precision in discrete HMM with modified VQ that gives multiple outputs for an input.
Keywords :
Gaussian distribution; hidden Markov models; maximum likelihood estimation; speech recognition; vector quantisation; Gaussian distribution; N-best vector quantization; Viterbi algorithm; acoustic model; hidden Markov model; isolated word speech recognition; recognition precision; Cepstrum; Covariance matrix; Data mining; Gaussian distribution; Hidden Markov models; Isolation technology; Mel frequency cepstral coefficient; Nose; Speech recognition; Vector quantization; Baum-Welch algorithm; Isolated word speech recognition; VQ; acoustic model improvement; discrete HMM; speaker-independent speech recognition; vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
SICE, 2007 Annual Conference
Conference_Location :
Takamatsu
Print_ISBN :
978-4-907764-27-2
Electronic_ISBN :
978-4-907764-27-2
Type :
conf
DOI :
10.1109/SICE.2007.4421326
Filename :
4421326
Link To Document :
بازگشت