• DocumentCode
    3481919
  • Title

    Speech conversion using MELP speech coding algorithm

  • Author

    Salor, Özgül ; Demirekler, Mübeccel

  • Author_Institution
    Orta Dogu Teknik Universitesi, Ankara, Turkey
  • fYear
    2004
  • fDate
    28-30 April 2004
  • Firstpage
    268
  • Lastpage
    271
  • Abstract
    In this work, the MELP (mixed excitation linear prediction) speech coding algorithm has been used for speech conversion. Speech conversion aims to modify the speech of one speaker such that the modified speech sounds as if spoken by another speaker. Speech modeling of MELP has been used to derive a mapping the between the speech models of the two speakers. We have obtained a mapping which provides a context-free speech conversion. We have mainly considered the spectral properties of the speakers. Using the 230 sentences of the two speakers, a mapping between the 4-stage vector quantization indexes for line spectral frequencies (LSF) of the two speakers have been obtained. Two different methods have been proposed to obtain a codebook for the second speaker from this mapping and both have been applied in addition to pitch modification during synthesis. The first method replaces the LSF index of the first speaker with that of the second speaker, which appears the most, during training. The second method uses the weighted average from the histogram of the second speaker that corresponds to the index of the first speaker, to form a new LSF codebook for the second speaker. Subjective ABX listening tests have been carried out and the correct speaker perception rate has been obtained as 70% and 65% for the first and the second spectral conversion methods respectively.
  • Keywords
    linear predictive coding; spectral analysis; speech coding; statistical analysis; table lookup; vector quantisation; LSF; MELP; codebook; context-free speech conversion; line spectral frequencies; mixed excitation linear prediction; pitch modification; spectral properties; speech coding algorithm; speech conversion; vector quantization indexes; weighted histogram average; Histograms; Loudspeakers; Prediction algorithms; Speech coding; Vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Communications Applications Conference, 2004. Proceedings of the IEEE 12th
  • Print_ISBN
    0-7803-8318-4
  • Type

    conf

  • DOI
    10.1109/SIU.2004.1338311
  • Filename
    1338311