DocumentCode :
2488137
Title :
Enhanced 2.4 kb/s mixed excitation linear prediction vocoder
Author :
Du, Song ; Cui, Huijuan
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
fYear :
1998
fDate :
22-24 Oct 1998
Abstract :
We describe the improvements of a voice codec that has been issued as a draft for the US Federal Information Processing Standards-analog to digital conversion of voice by 2400 bit/second mixed excitation linear prediction (MELP) on June 12, 1997. In pitch estimation in MELP, a pitch doubling check algorithm and a strong voice pitch smoothing algorithm are applied. However, these algorithms are too simple to compute an accurate and smooth pitch period, and a leap of the pitch happens sometimes, especially during voice transition, about 5% to 10% of the pitch estimates are still not correct. In order to obtain a more accurate and smooth pitch period, a dynamic frame relative smoothing algorithm is applied to optimize the pitch period in MELP. After pitch smoothing almost all the errors are eliminated. In order to fit Chinese, we retrain the prediction parameters codebook for MELP using the simulated annealing algorithm based on a Chinese voice database. The Itakura distance test of distortion is applied, which shows the codebook obtained by the simulated annealing algorithm has less distortion than the codebook obtained by the traditional LBG algorithm. The probability of distortion of the former is 15% greater than the latter, for an Itakura distance between -0.1 and 0.1. The enhanced algorithm gives a better codebook for more fluent Chinese synthetic speech
Keywords :
linear predictive coding; parameter estimation; simulated annealing; smoothing methods; speech codecs; speech coding; vector quantisation; vocoders; 2.4 kbit/s; Chinese; Itakura distance test; LBG algorithm; MELP; PLC; US Federal Information Processing Standards; VQ codebook; analog to digital conversion; distortion probability; dynamic frame relative smoothing algorithm; mixed excitation linear prediction vocoder; pitch doubling check algorithm; pitch estimates; pitch estimation; pitch period; prediction parameters codebook; simulated annealing algorithm; synthetic speech; voice codec; voice database; voice pitch smoothing algorithm; voice transition; Autocorrelation; Degradation; Dynamic programming; Filters; Linear predictive coding; Predictive models; Simulated annealing; Smoothing methods; Speech enhancement; Vocoders;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communication Technology Proceedings, 1998. ICCT '98. 1998 International Conference on
Conference_Location :
Beijing
Print_ISBN :
7-80090-827-5
Type :
conf
DOI :
10.1109/ICCT.1998.741016
Filename :
741016
Link To Document :
بازگشت