DocumentCode
694560
Title
Sub-band Unvoiced/Voiced parameter extraction and efficient quantization for speech signal
Author
Liang Chen ; Yi-peng Zhang ; Liang Pang
Author_Institution
Postgrad. Team 2 CCE, PLA Univ. of Sci & Tech, Nanjing, China
fYear
2013
fDate
12-13 Oct. 2013
Firstpage
1203
Lastpage
1207
Abstract
In Mixed Excitation Linear Prediction algorithm (MELP), the sub-band Unvoiced/Voiced parameters play an important role in improving the naturalness of synthetic speech. However, the coding efficiency with five bits per frame brings difficulties for very low bit rate speech coding. In this paper, the three consecutive MELP frames are grouped into a super-frame, and the fifteen dim sub-band Unvoiced/Voiced parameters are quantized. Through counting the Unvoiced/Voiced distribution probability and optimizing the codebook designed by the distortion measure, it is implemented that every fifteen dim Unvoiced/Voiced vector is quantized efficiently with three bits for each super-frame. Simulation results show that the intelligibility and naturalness are efficiently maintained for synthesis speech, and the quantization scheme can be widely applied to speech coding algorithm below 600bps.
Keywords
linear predictive coding; speech coding; speech synthesis; MELP; efficient quantization; mixed excitation linear prediction algorithm; speech coding; speech signal; subband unvoiced/voiced parameter extraction; synthetic speech; Algorithm design and analysis; Joints; Speech; Speech coding; Training; Vector quantization; Joint vector quantization; Mixed Excitation Linear Prediction (MELP); Perceptual Evaluation of Speech Quality (PESQ); Subband voiced/unvoiced;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Network Technology (ICCSNT), 2013 3rd International Conference on
Conference_Location
Dalian
Type
conf
DOI
10.1109/ICCSNT.2013.6967318
Filename
6967318
Link To Document