• DocumentCode
    2287327
  • Title

    Speaker normalization by input space optimization for continuous density hidden Markov models

  • Author

    Wu, Jianxiong ; Qi, Zeyu ; Chan, Chorkin ; Li, Jiegu

  • Author_Institution
    Inst. of Image Process. & Pattern Recognition, Shanghai Jiaotong Univ., China
  • fYear
    1994
  • fDate
    13-16 Apr 1994
  • Firstpage
    682
  • Abstract
    This paper proposes a novel method of speaker normalization by means of input space optimization for continuous density hidden Markov models (CDHMM). The parameters of a linear feature transformation function are so determined that, together with the previously trained CDHMM parameters, a mis-classification cost function is minimized for the normalizing data set. Preliminary experimental results on the task of sex adaptation for speaker-independent stop consonant discrimination, evaluated from the DARPA TIMIT speech database, demonstrates the effectiveness of the proposed method
  • Keywords
    acoustic signal processing; hidden Markov models; optimisation; speech analysis and processing; speech recognition; CDHMM; DARPA TIMIT speech database; acoustic variability; continuous density hidden Markov models; experimental results; input space optimization; linear feature transformation function; mis-classification cost function; normalizing data set; sex adaptation; speaker normalization; speaker-independent stop consonant discrimination; speech recognition; Automatic speech recognition; Cost function; Hidden Markov models; Image processing; Loudspeakers; Neural networks; Optimization methods; Spatial databases; Speech analysis; Speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Speech, Image Processing and Neural Networks, 1994. Proceedings, ISSIPNN '94., 1994 International Symposium on
  • Print_ISBN
    0-7803-1865-X
  • Type

    conf

  • DOI
    10.1109/SIPNN.1994.344837
  • Filename
    344837