Title :
Excitation codebook design for coding of the singing voice
Author :
Kim, Youngmoo E.
Author_Institution :
Media Lab., MIT, Cambridge, MA, USA
Abstract :
The technique of code excited linear prediction (CELP) has led to the development of voice coding systems that provide toll quality speech at very low bitrates. While speech and singing share many similarities in terms of production, standard speech coding implementations fall far short when transmitting the singing voice. This paper explores the reasons for this discrepancy and suggests new variations on CELP speech coders that specifically enhance the quality of encoded singing for individual singers. These modifications could be used in a low-bitrate singing voice codec which, in conjunction with multi-track structured coding schemes such as MPEG-4 structured audio, could provide a highly compressed yet high-quality representation of a complex audio scene
Keywords :
audio coding; data compression; linear predictive coding; music; speech codecs; speech coding; CELP; MPEG-4 structured audio; code excited linear prediction; complex audio scene; excitation codebook design; linear predictive coding; singing voice codec; singing voice coding; speech coding; toll quality speech; Bit rate; Codecs; Filters; Linear predictive coding; Resonance; Speech coding; Speech enhancement; Teeth; Tongue; Transmitters;
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics, 2001 IEEE Workshop on the
Conference_Location :
New Platz, NY
Print_ISBN :
0-7803-7126-7
DOI :
10.1109/ASPAA.2001.969566