Title :
Frame energy estimation based on speech codec parameters
Author :
Kim, Doh-Suk ; Cao, Binshi ; Tarraf, Ahmed
Author_Institution :
Alcatel-Lucent, Whippany, NJ
fDate :
March 31 2008-April 4 2008
Abstract :
This paper proposes an efficient method for estimating frame energy of speech from enhanced variable rate coder (EVRC) bitstream for network-based speech processing applications in transcoder free operation (TrFO) environments, where speech signals are represented as speech coding parameters. A frame of speech energy is decomposed into the energy of excitation and vocal tract filter, and the frame energy estimation method is derived for each component. Among many parameters of EVRC bitstream, the fixed codebook gain and adaptive codebook gain are used for the estimation of excitation energy, and line spectrum pair (LSP) information is used to estimate the energy of vocal tract filter. Experimental results demonstrated the novelty of the proposed method. The correlation coefficient between the actual and estimated frame energy can be maintained at a value of 0.994 with just 5% multiplicative operations of full decoding.
Keywords :
decoding; filtering theory; signal representation; spectral analysis; speech codecs; speech coding; transcoding; variable rate codes; vocoders; EVRC bitstream; decoding; enhanced variable rate coder; frame energy estimation; line spectrum pair information; network-based speech processing; signal representation; speech codec; speech coding parameters; transcoder free operation environment; vocal tract filter; Decoding; Filters; Linear predictive coding; Signal processing; Signal synthesis; Speech codecs; Speech coding; Speech enhancement; Speech processing; Speech synthesis; CELP; EVRC; TrFO; codec parameters; frame energy estimation;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4517941