• DocumentCode
    591768
  • Title

    Effective sentence selection based on phone/model coverage maximization for speaker adaptation in HMM-based speech synthesis

  • Author

    Cheng Hsien Lin ; Po Kai Huang ; Cheng Yuan Lin ; Chih Chung Kuo

  • Author_Institution
    ITRI, Hsinchu, Taiwan
  • fYear
    2012
  • fDate
    5-8 Dec. 2012
  • Firstpage
    74
  • Lastpage
    78
  • Abstract
    Reducing the recording effort required in practical speaker adaptive text-to-speech applications would be very useful. In this paper, we present two sentence selection approaches based on a greedy algorithm; one is based on phone coverage and the other is based on model coverage. The former considers the phonetic information in speaker adaptation data, while the latter focuses on occurrences of Mel-cepstral and logF0 models in decision trees of the average voice model. To verify the efficacy of the proposed methods, we compare their performance with that of a random selection method in objective and subjective evaluations. The objective and subjective evaluation results demonstrate that both methods outperform the random selection method.
  • Keywords
    hidden Markov models; speech synthesis; HMM-based speech synthesis; Mel-cepstral models; logF0 models; model coverage; phone coverage; phone-model coverage maximization; random selection method; sentence selection; speaker adaptation data; speaker adaptive text-to-speech applications; Adaptation models; Data models; Greedy algorithms; Hidden Markov models; Speech; Speech synthesis; Training; HMM-based speech synthesis; greedy algorithm; model coverage; speaker adaptation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
  • Conference_Location
    Kowloon
  • Print_ISBN
    978-1-4673-2506-6
  • Electronic_ISBN
    978-1-4673-2505-9
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2012.6423469
  • Filename
    6423469