Sub-band Unvoiced/Voiced parameter extraction and efficient quantization for speech signal

Author

Liang Chen ; Yi-peng Zhang ; Liang Pang

Author_Institution

Postgrad. Team 2 CCE, PLA Univ. of Sci & Tech, Nanjing, China

fYear

2013

fDate

12-13 Oct. 2013

Firstpage

1203

Lastpage

1207

Abstract

In Mixed Excitation Linear Prediction algorithm (MELP), the sub-band Unvoiced/Voiced parameters play an important role in improving the naturalness of synthetic speech. However, the coding efficiency with five bits per frame brings difficulties for very low bit rate speech coding. In this paper, the three consecutive MELP frames are grouped into a super-frame, and the fifteen dim sub-band Unvoiced/Voiced parameters are quantized. Through counting the Unvoiced/Voiced distribution probability and optimizing the codebook designed by the distortion measure, it is implemented that every fifteen dim Unvoiced/Voiced vector is quantized efficiently with three bits for each super-frame. Simulation results show that the intelligibility and naturalness are efficiently maintained for synthesis speech, and the quantization scheme can be widely applied to speech coding algorithm below 600bps.

Keywords

linear predictive coding; speech coding; speech synthesis; MELP; efficient quantization; mixed excitation linear prediction algorithm; speech coding; speech signal; subband unvoiced/voiced parameter extraction; synthetic speech; Algorithm design and analysis; Joints; Speech; Speech coding; Training; Vector quantization; Joint vector quantization; Mixed Excitation Linear Prediction (MELP); Perceptual Evaluation of Speech Quality (PESQ); Subband voiced/unvoiced;

fLanguage

English

Publisher

ieee

Conference_Titel

Computer Science and Network Technology (ICCSNT), 2013 3rd International Conference on

Conference_Location

Dalian

Type

conf

DOI

10.1109/ICCSNT.2013.6967318

Filename

6967318