• DocumentCode
    2488137
  • Title

    Enhanced 2.4 kb/s mixed excitation linear prediction vocoder

  • Author

    Du, Song ; Cui, Huijuan

  • Author_Institution
    Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
  • fYear
    1998
  • fDate
    22-24 Oct 1998
  • Abstract
    We describe the improvements of a voice codec that has been issued as a draft for the US Federal Information Processing Standards-analog to digital conversion of voice by 2400 bit/second mixed excitation linear prediction (MELP) on June 12, 1997. In pitch estimation in MELP, a pitch doubling check algorithm and a strong voice pitch smoothing algorithm are applied. However, these algorithms are too simple to compute an accurate and smooth pitch period, and a leap of the pitch happens sometimes, especially during voice transition, about 5% to 10% of the pitch estimates are still not correct. In order to obtain a more accurate and smooth pitch period, a dynamic frame relative smoothing algorithm is applied to optimize the pitch period in MELP. After pitch smoothing almost all the errors are eliminated. In order to fit Chinese, we retrain the prediction parameters codebook for MELP using the simulated annealing algorithm based on a Chinese voice database. The Itakura distance test of distortion is applied, which shows the codebook obtained by the simulated annealing algorithm has less distortion than the codebook obtained by the traditional LBG algorithm. The probability of distortion of the former is 15% greater than the latter, for an Itakura distance between -0.1 and 0.1. The enhanced algorithm gives a better codebook for more fluent Chinese synthetic speech
  • Keywords
    linear predictive coding; parameter estimation; simulated annealing; smoothing methods; speech codecs; speech coding; vector quantisation; vocoders; 2.4 kbit/s; Chinese; Itakura distance test; LBG algorithm; MELP; PLC; US Federal Information Processing Standards; VQ codebook; analog to digital conversion; distortion probability; dynamic frame relative smoothing algorithm; mixed excitation linear prediction vocoder; pitch doubling check algorithm; pitch estimates; pitch estimation; pitch period; prediction parameters codebook; simulated annealing algorithm; synthetic speech; voice codec; voice database; voice pitch smoothing algorithm; voice transition; Autocorrelation; Degradation; Dynamic programming; Filters; Linear predictive coding; Predictive models; Simulated annealing; Smoothing methods; Speech enhancement; Vocoders;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communication Technology Proceedings, 1998. ICCT '98. 1998 International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    7-80090-827-5
  • Type

    conf

  • DOI
    10.1109/ICCT.1998.741016
  • Filename
    741016