DocumentCode :
431644
Title :
Jointly optimal time segmentation, component selection and quantization for sinusoidal coding of audio and speech
Author :
Heusdens, Richard ; Jensen, Jesper
Author_Institution :
Dept. of Mediamatics, Delft Univ. of Technol., Netherlands
Volume :
3
fYear :
2005
fDate :
18-23 March 2005
Abstract :
We propose a rate-distortion optimal algorithm for sinusoidal modeling of audio and speech. The algorithm determines, for a pre-specified target bit-rate, the optimal (variable-length) time segmentation, the optimal distribution of sinusoidal components over the segments and the optimal (scalar) quantizers for quantizing the sinusoid parameters. The optimization is done by jointly optimizing the segment lengths, number of sinusoids and quantizers using high-resolution quantization theory and dynamic programming techniques, which makes it possible to solve the algorithm in polynomial time. A particular advantage of the proposed method is that, given a target bit-rate, it solves the problem of finding the optimal balance between total number of sinusoids and number of bits per sinusoid.
Keywords :
audio coding; dynamic programming; quantisation (signal); rate distortion theory; speech coding; dynamic programming; high-resolution quantization theory; optimal quantization; optimal time segmentation; optimization; polynomial time; rate-distortion optimal algorithm; sinusoidal audio coding; sinusoidal component selection; sinusoidal speech coding; variable-length time segmentation; Auditory system; Dynamic programming; Entropy; Frequency; Humans; Polynomials; Quantization; Rate distortion theory; Signal analysis; Speech coding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8874-7
Type :
conf
DOI :
10.1109/ICASSP.2005.1415679
Filename :
1415679
Link To Document :
بازگشت