• DocumentCode
    2665105
  • Title

    An acoustic model adaptation using HMM-based speech synthesis

  • Author

    Tanaka, Koji ; Kuroiwa, Shingo ; Tsuge, Satoru ; Ren, Fuji

  • Author_Institution
    Dept. of Inf. Sci. & Intelligent Syst., Tokushima Univ., Japan
  • fYear
    2003
  • fDate
    26-29 Oct. 2003
  • Firstpage
    368
  • Lastpage
    373
  • Abstract
    Recently, personal digital assistants like cellular phones are shifting to the IP terminal. The encoding-decoding process utilized for transmitting over IP networks deteriorates the quality of the speech data. This deterioration causes degradation in speech recognition performance. Acoustic model adaptations could improve recognition performance. However, the current adaptation methods usually require a large amount of adaptation data. A novel adaptation method using speech synthesis based on HMM (hidden Markov model) is proposed. This method does not require speech data for adaptation because speech data is generated by speech synthesis from the acoustic model. Experimental results on G.723.1 coded speech recognition show that the proposed method improves speech recognition performance. A relative improvement in word accuracy of approximately 2% was observed.
  • Keywords
    Internet telephony; hidden Markov models; speech recognition; speech synthesis; G.723.1 coded speech recognition; HMM-based speech synthesis; IP networks; acoustic model adaptation; cellular phone; encoding-decoding process; hidden Markov model; personal digital assistants; speech recognition performance; Acoustic distortion; Adaptation model; Cellular phones; Hidden Markov models; Personal digital assistants; Speech analysis; Speech coding; Speech recognition; Speech synthesis; Telephony;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
  • Conference_Location
    Beijing, China
  • Print_ISBN
    0-7803-7902-0
  • Type

    conf

  • DOI
    10.1109/NLPKE.2003.1275933
  • Filename
    1275933