Speaker normalization by input space optimization for continuous density hidden Markov models

Author

Wu, Jianxiong ; Qi, Zeyu ; Chan, Chorkin ; Li, Jiegu

Author_Institution

Inst. of Image Process. & Pattern Recognition, Shanghai Jiaotong Univ., China

fYear

1994

fDate

13-16 Apr 1994

Firstpage

682

Abstract

This paper proposes a novel method of speaker normalization by means of input space optimization for continuous density hidden Markov models (CDHMM). The parameters of a linear feature transformation function are so determined that, together with the previously trained CDHMM parameters, a mis-classification cost function is minimized for the normalizing data set. Preliminary experimental results on the task of sex adaptation for speaker-independent stop consonant discrimination, evaluated from the DARPA TIMIT speech database, demonstrates the effectiveness of the proposed method

Keywords

acoustic signal processing; hidden Markov models; optimisation; speech analysis and processing; speech recognition; CDHMM; DARPA TIMIT speech database; acoustic variability; continuous density hidden Markov models; experimental results; input space optimization; linear feature transformation function; mis-classification cost function; normalizing data set; sex adaptation; speaker normalization; speaker-independent stop consonant discrimination; speech recognition; Automatic speech recognition; Cost function; Hidden Markov models; Image processing; Loudspeakers; Neural networks; Optimization methods; Spatial databases; Speech analysis; Speech processing;

fLanguage

English

Publisher

ieee

Conference_Titel

Speech, Image Processing and Neural Networks, 1994. Proceedings, ISSIPNN '94., 1994 International Symposium on

Print_ISBN

0-7803-1865-X

Type

conf

DOI

10.1109/SIPNN.1994.344837

Filename

344837