• DocumentCode
    691943
  • Title

    A New Model-Based Prosody Coder for Mandarin Speech

  • Author

    Chen-Yu Chiang ; Yu-Ping Hung ; Sin-Horng Chen ; Yih-Ru Wang

  • Author_Institution
    Dept. of Commun. Eng., Nat. Taipei Univ., Taipei, Taiwan
  • fYear
    2013
  • fDate
    16-18 Oct. 2013
  • Firstpage
    60
  • Lastpage
    63
  • Abstract
    In this paper, a novel parametric prosody coding approach for Mandarin speech is proposed. It employs a hierarchical prosodic model (HPM) as a prosody generating model in the encoder to analyze the speech prosody of the input utterance to obtain a parametric representation of four prosodic-acoustic features of syllable pitch contour, syllable duration, syllable energy level, and syllable-juncture pause duration for encoding. In the decoder, the four prosodic-acoustic features are reconstructed by a synthesis operation using the decoded HPM parameters. The reconstructed prosodic features are lastly used in an HMM-based speech synthesizer to help to generate the reconstructed speech. Experimental results show that the reconstructed speech has good quality at low data rates of 114.9 bits/s for a speaker-dependent task. An informal listening test confirmed decoded speeches sounded very fluently.
  • Keywords
    acoustic signal processing; signal reconstruction; speech coding; HMM-based speech synthesizer; HPM parameter decoding; Mandarin speech prosody analysis; data rates; hierarchical prosodic model; informal listening test; input utterance; model-based prosody coder; parametric prosody coding approach; parametric representation; prosodic-acoustic feature reconstruction; prosody generating model; speaker-dependent task; speech decoding; speech reconstruction; syllable duration; syllable energy level; syllable pitch contour; syllable-juncture pause duration; synthesis operation; Decoding; Energy states; Hidden Markov models; Pragmatics; Speech; Speech coding; Prosodic model; Prosody coding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Information Hiding and Multimedia Signal Processing, 2013 Ninth International Conference on
  • Conference_Location
    Beijing
  • Type

    conf

  • DOI
    10.1109/IIH-MSP.2013.24
  • Filename
    6846580