• DocumentCode
    1761805
  • Title

    Polyglot Speech Synthesis Based on Cross-Lingual Frame Selection Using Auditory and Articulatory Features

  • Author

    Chia-Ping Chen ; Yi-Chin Huang ; Chung-Hsien Wu ; Kuan-De Lee

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Nat. Sun Yat-Sen Univ., Kaohsiung, Taiwan
  • Volume
    22
  • Issue
    10
  • fYear
    2014
  • fDate
    Oct. 2014
  • Firstpage
    1558
  • Lastpage
    1570
  • Abstract
    In this paper, an approach for polyglot speech synthesis based on cross-lingual frame selection is proposed. This method requires only mono-lingual speech data of different speakers in different languages for building a polyglot synthesis system, thus reducing the burden of data collection. Essentially, a set of artificial utterances in the second language for a target speaker is constructed based on the proposed cross-lingual frame-selection process, and this data set is used to adapt a synthesis model in the second language to the speaker. In the cross-lingual frame-selection process, we propose to use auditory and articulatory features to improve the quality of the synthesized polyglot speech. For evaluation, a Mandarin-English polyglot system is implemented where the target speaker only speaks Mandarin. The results show that decent performance regarding voice identity and speech quality can be achieved with the proposed method.
  • Keywords
    natural language processing; speech synthesis; Mandarin-English polyglot system; articulatory features; auditory features; cross-lingual frame-selection process; polyglot speech synthesis; Adaptation models; Feature extraction; Hidden Markov models; IEEE transactions; Speech; Speech synthesis; Articulatory features; auditory features; cross-lingual frame selection; polyglot speech synthesis;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2014.2339738
  • Filename
    6857339