• DocumentCode
    2017158
  • Title

    Improving GMM-based spectral conversion with optimal conversion function selection

  • Author

    Hwang, Hsin-Te ; Wu, Wen-Liang ; Chen, Sin-Horng

  • Author_Institution
    Dept. of Electr. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
  • fYear
    2010
  • fDate
    Nov. 29 2010-Dec. 3 2010
  • Firstpage
    392
  • Lastpage
    396
  • Abstract
    We address the problem in the conventional Gaussian mixture model (GMM)-based spectral conversion from the viewpoint of optimal conversion function selection. The proposed method is motivated by that if the optimal conversion function based on minimum mel-cepstral distortion (MMCD) criterion can be selected during the conversion stage, the conversion performance in terms of mel-cepstral distortion (MCD) can be improved dramatically. To this end, our goal is to improve the accuracy rate of the optimal conversion function selection by the MMCD-based data clustering with Linear Discriminant Analysis (LDA). Experiment results confirmed that the proposed method can effectively improve the conventional method.
  • Keywords
    Gaussian processes; pattern clustering; speech processing; speech synthesis; statistical analysis; GMM-based spectral conversion; Gaussian mixture model; data clustering; linear discriminant analysis; minimum mel-cepstral distortion; optimal conversion function selection; voice conversion; Accuracy; Joints; Linear discriminant analysis; Speech; Training; Transforms; Vectors; Gaussian mixture model (GMM); Voice conversion (VC);
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
  • Conference_Location
    Tainan
  • Print_ISBN
    978-1-4244-6244-5
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2010.5684860
  • Filename
    5684860