DocumentCode :
2961937
Title :
High-quality digital speech at 4 kb/s
Author :
Granzow, Wolfgang ; Atal, Bishnu S.
Author_Institution :
AT&T Bell Lab., Murray Hill, NJ, USA
fYear :
1990
fDate :
2-5 Dec 1990
Firstpage :
941
Abstract :
A speech coder based on a single-pulse excitation code-excited linear predictive coding (SPE-CELP) model of linear-predictive coding (LPC) is proposed. An algorithm for determining the time instants of pitch periods within a short interval of periodic speech, which results in a time sequence of marker points that indicate the beginning of the pitch periods in the analyzed speech interval, is described. The LPC excitation is generated by a stochastic codebook for nonperiodic speech and by a single pulse per pitch period for periodic speech. The proper alignment of the excitation pulse is efficiently computed using dynamic programming. It is concluded that, at overall bit rates of around 3 kb/s, the coder produces significantly better speech quality than LPC10E, though the synthesized speech still sounds slightly buzzy for certain speakers
Keywords :
dynamic programming; encoding; speech analysis and processing; speech synthesis; 4 kbit/s; SPE-CELP model; algorithm; dynamic programming; excitation pulse alignment; high-quality digital speech; nonperiodic speech; periodic speech; pitch periods; single pulse; single-pulse excitation code-excited linear predictive coding; speech coder; stochastic codebook; time instants; Algorithm design and analysis; Bit rate; Dynamic programming; Linear predictive coding; Predictive models; Pulse generation; Speech analysis; Speech coding; Speech synthesis; Stochastic processes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Global Telecommunications Conference, 1990, and Exhibition. 'Communications: Connecting the Future', GLOBECOM '90., IEEE
Conference_Location :
San Diego, CA
Print_ISBN :
0-87942-632-2
Type :
conf
DOI :
10.1109/GLOCOM.1990.116641
Filename :
116641
Link To Document :
بازگشت