DocumentCode :
694560
Title :
Sub-band Unvoiced/Voiced parameter extraction and efficient quantization for speech signal
Author :
Liang Chen ; Yi-peng Zhang ; Liang Pang
Author_Institution :
Postgrad. Team 2 CCE, PLA Univ. of Sci & Tech, Nanjing, China
fYear :
2013
fDate :
12-13 Oct. 2013
Firstpage :
1203
Lastpage :
1207
Abstract :
In Mixed Excitation Linear Prediction algorithm (MELP), the sub-band Unvoiced/Voiced parameters play an important role in improving the naturalness of synthetic speech. However, the coding efficiency with five bits per frame brings difficulties for very low bit rate speech coding. In this paper, the three consecutive MELP frames are grouped into a super-frame, and the fifteen dim sub-band Unvoiced/Voiced parameters are quantized. Through counting the Unvoiced/Voiced distribution probability and optimizing the codebook designed by the distortion measure, it is implemented that every fifteen dim Unvoiced/Voiced vector is quantized efficiently with three bits for each super-frame. Simulation results show that the intelligibility and naturalness are efficiently maintained for synthesis speech, and the quantization scheme can be widely applied to speech coding algorithm below 600bps.
Keywords :
linear predictive coding; speech coding; speech synthesis; MELP; efficient quantization; mixed excitation linear prediction algorithm; speech coding; speech signal; subband unvoiced/voiced parameter extraction; synthetic speech; Algorithm design and analysis; Joints; Speech; Speech coding; Training; Vector quantization; Joint vector quantization; Mixed Excitation Linear Prediction (MELP); Perceptual Evaluation of Speech Quality (PESQ); Subband voiced/unvoiced;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Network Technology (ICCSNT), 2013 3rd International Conference on
Conference_Location :
Dalian
Type :
conf
DOI :
10.1109/ICCSNT.2013.6967318
Filename :
6967318
Link To Document :
بازگشت