Title :
Jointly optimal time segmentation, component selection and quantization for sinusoidal coding of audio and speech
Author :
Heusdens, Richard ; Jensen, Jesper
Author_Institution :
Dept. of Mediamatics, Delft Univ. of Technol., Netherlands
Abstract :
We propose a rate-distortion optimal algorithm for sinusoidal modeling of audio and speech. The algorithm determines, for a pre-specified target bit-rate, the optimal (variable-length) time segmentation, the optimal distribution of sinusoidal components over the segments and the optimal (scalar) quantizers for quantizing the sinusoid parameters. The optimization is done by jointly optimizing the segment lengths, number of sinusoids and quantizers using high-resolution quantization theory and dynamic programming techniques, which makes it possible to solve the algorithm in polynomial time. A particular advantage of the proposed method is that, given a target bit-rate, it solves the problem of finding the optimal balance between total number of sinusoids and number of bits per sinusoid.
Keywords :
audio coding; dynamic programming; quantisation (signal); rate distortion theory; speech coding; dynamic programming; high-resolution quantization theory; optimal quantization; optimal time segmentation; optimization; polynomial time; rate-distortion optimal algorithm; sinusoidal audio coding; sinusoidal component selection; sinusoidal speech coding; variable-length time segmentation; Auditory system; Dynamic programming; Entropy; Frequency; Humans; Polynomials; Quantization; Rate distortion theory; Signal analysis; Speech coding;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
Print_ISBN :
0-7803-8874-7
DOI :
10.1109/ICASSP.2005.1415679