DocumentCode :
2132384
Title :
A vector-quantizer based method of speaker normalization
Author :
Shin, Ok Keun
Author_Institution :
Sch. of Inf. Technol., Korea Maritime Univ., Busan, South Korea
fYear :
2005
fDate :
2005
Firstpage :
402
Lastpage :
407
Abstract :
As an effort to reduce the performance decline of speaker independent speech recognizers due to inter-speaker variations of vocal tract length among population, a method of speaker normalization based on vector quantization is proposed. In this paper, presented is an iterative method of constructing the ´normalized´ codebook that can be used as a text independent warp factor estimator for LVCSR system. Given the normalized codebook, the warp factor is estimated by searching the best fitting warped version of feature vectors of a given utterance. Throughout the whole process of normalized codebook construction and warp factor estimation, neither acoustic, nor phonetic knowledge is made use of The effectiveness of the proposed method is investigated by performing recognition experiments. The results showed more than 4% improvements in word level accuracy.
Keywords :
audio coding; feature extraction; iterative methods; speech recognition; vector quantisation; best fitting warped feature vectors; interspeaker variations; iterative method; normalized codebook; performance declination; phonetic knowledge; speaker independent speech recognizer; speaker normalization; speech recognition; text independent warp factor estimator; utterance; vector quantization; vocal tract length; warp factor estimation; Feature extraction; Frequency; Hidden Markov models; Information technology; Iterative methods; Loudspeakers; Maximum likelihood estimation; Signal processing; Speech recognition; Vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Science, 2005. Fourth Annual ACIS International Conference on
Print_ISBN :
0-7695-2296-3
Type :
conf
DOI :
10.1109/ICIS.2005.21
Filename :
1515437
Link To Document :
بازگشت