Title :
N-best vector quantization for isolated word speech recognition
Author :
Nose, Masaya ; Maki, Shuichi ; Yamane, Nobumoto ; Morikawa, Yoshitaka
Author_Institution :
Okayama Univ., Okayama
Abstract :
Speech recognition is performed by utilizing acoustic and linguistic model. The contribution of this paper is improvement of acoustic model. Acoustic model is constructed by hidden Markov model (HMM). HMM has two representations, that are discrete HMM and continuous HMM. The former uses vector quantization (VQ), whereas the latter uses functions such as (mixture) Gaussian distribution. In Viterbi algorithm, VQ has advantage that it only operates by addition. However VQ also has a problem of distortion. This paper attempts to improve recognition precision in discrete HMM with modified VQ that gives multiple outputs for an input.
Keywords :
Gaussian distribution; hidden Markov models; maximum likelihood estimation; speech recognition; vector quantisation; Gaussian distribution; N-best vector quantization; Viterbi algorithm; acoustic model; hidden Markov model; isolated word speech recognition; recognition precision; Cepstrum; Covariance matrix; Data mining; Gaussian distribution; Hidden Markov models; Isolation technology; Mel frequency cepstral coefficient; Nose; Speech recognition; Vector quantization; Baum-Welch algorithm; Isolated word speech recognition; VQ; acoustic model improvement; discrete HMM; speaker-independent speech recognition; vector quantization;
Conference_Titel :
SICE, 2007 Annual Conference
Conference_Location :
Takamatsu
Print_ISBN :
978-4-907764-27-2
Electronic_ISBN :
978-4-907764-27-2
DOI :
10.1109/SICE.2007.4421326