DocumentCode
2488137
Title
Enhanced 2.4 kb/s mixed excitation linear prediction vocoder
Author
Du, Song ; Cui, Huijuan
Author_Institution
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
fYear
1998
fDate
22-24 Oct 1998
Abstract
We describe the improvements of a voice codec that has been issued as a draft for the US Federal Information Processing Standards-analog to digital conversion of voice by 2400 bit/second mixed excitation linear prediction (MELP) on June 12, 1997. In pitch estimation in MELP, a pitch doubling check algorithm and a strong voice pitch smoothing algorithm are applied. However, these algorithms are too simple to compute an accurate and smooth pitch period, and a leap of the pitch happens sometimes, especially during voice transition, about 5% to 10% of the pitch estimates are still not correct. In order to obtain a more accurate and smooth pitch period, a dynamic frame relative smoothing algorithm is applied to optimize the pitch period in MELP. After pitch smoothing almost all the errors are eliminated. In order to fit Chinese, we retrain the prediction parameters codebook for MELP using the simulated annealing algorithm based on a Chinese voice database. The Itakura distance test of distortion is applied, which shows the codebook obtained by the simulated annealing algorithm has less distortion than the codebook obtained by the traditional LBG algorithm. The probability of distortion of the former is 15% greater than the latter, for an Itakura distance between -0.1 and 0.1. The enhanced algorithm gives a better codebook for more fluent Chinese synthetic speech
Keywords
linear predictive coding; parameter estimation; simulated annealing; smoothing methods; speech codecs; speech coding; vector quantisation; vocoders; 2.4 kbit/s; Chinese; Itakura distance test; LBG algorithm; MELP; PLC; US Federal Information Processing Standards; VQ codebook; analog to digital conversion; distortion probability; dynamic frame relative smoothing algorithm; mixed excitation linear prediction vocoder; pitch doubling check algorithm; pitch estimates; pitch estimation; pitch period; prediction parameters codebook; simulated annealing algorithm; synthetic speech; voice codec; voice database; voice pitch smoothing algorithm; voice transition; Autocorrelation; Degradation; Dynamic programming; Filters; Linear predictive coding; Predictive models; Simulated annealing; Smoothing methods; Speech enhancement; Vocoders;
fLanguage
English
Publisher
ieee
Conference_Titel
Communication Technology Proceedings, 1998. ICCT '98. 1998 International Conference on
Conference_Location
Beijing
Print_ISBN
7-80090-827-5
Type
conf
DOI
10.1109/ICCT.1998.741016
Filename
741016
Link To Document