Title :
Structured encoding of the singing voice using prior knowledge of the musical score
Author :
Kim, Youngmoo E.
Author_Institution :
Media Lab., MIT, Cambridge, MA, USA
Abstract :
The human voice is the most difficult musical instrument to simulate convincingly. Yet a great deal of progress has been made in voice coding, the parameterization and re-synthesis of a source signal according to an assumed voice model. Source-filter models of the human voice, particularly linear predictive coding (LPC), are the basis of most low bit rate (speech) coding techniques in use today. This paper introduces a technique for coding the singing voice using LPC and prior knowledge of the musical score to aid in the process of encoding, reducing the amount of data required to represent the voice. This approach advances the singing voice closer towards a structured audio model in which musical parameters such as pitch, duration, and phonemes are represented orthogonally to the synthesis technique and can thus be modified prior to re-synthesis
Keywords :
audio signal processing; data compression; feature extraction; filtering theory; linear predictive coding; music; parameter estimation; speech coding; LPC; duration; human voice; linear predictive coding; low bit rate speech coding; musical instrument; musical parameters representation; musical score; phonemes; pitch; score-based parameter extraction; singing voice; source signal parameterization; source signal re-synthesis; source-filter models; structured audio model; structured encoding; synthesis technique; voice coding; voice model; Data compression; Encoding; Filters; Human voice; Linear predictive coding; Signal analysis; Signal synthesis; Speech analysis; Speech coding; Speech synthesis;
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics, 1999 IEEE Workshop on
Conference_Location :
New Paltz, NY
Print_ISBN :
0-7803-5612-8
DOI :
10.1109/ASPAA.1999.810846