• DocumentCode
    1323599
  • Title

    Mixture of Factor Analyzers Using Priors From Non-Parallel Speech for Voice Conversion

  • Author

    Wu, Zhizheng ; Kinnunen, Tomi ; Chng, Eng Siong ; Li, Haizhou

  • Author_Institution
    Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • Volume
    19
  • Issue
    12
  • fYear
    2012
  • Firstpage
    914
  • Lastpage
    917
  • Abstract
    A robust voice conversion function relies on a large amount of parallel training data, which is difficult to collect in practice. To tackle the sparse parallel training data problem in voice conversion, this paper describes a mixture of factor analyzers method which integrates prior knowledge from non-parallel speech into the training of conversion function. The experiments on CMU ARCTIC corpus show that the proposed method improves the quality and similarity of converted speech. With both objective and subjective evaluations, we show the proposed method outperforms the baseline GMM method.
  • Keywords
    speech processing; CMU ARCTIC corpus; baseline GMM method; factor analyzers; nonparallel speech; objective evaluations; robust voice conversion function; sparse parallel training data problem; subjective evaluations; Covariance matrix; Expectation-maximization algorithms; Speech; Training data; Vectors; Voice conversion; factor analysis; mixture of factor analyzers; prior knowledge;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2012.2225615
  • Filename
    6334423