Title :
Mixed wideband speech and music coding using a speech/music discriminator
Author_Institution :
Telecommun. & Ind. Phys., CSIRO, Epping, NSW
Abstract :
In multimedia applications such as videoconferencing, users are demanding higher quality speech/audio transmission than the POTS can offer. 7kHz wideband speech/audio offers a good compromise between bandwidth and sound quality. It improves the intelligibility and naturalness of speech and adds a feeling of transparent communication. Currently the only existing international standard for coding such signals is the G.722 wideband speech/audio coder. While its coding quality is satisfactory it leaves much to be desired with its bit rate. The CELP-based approach has been very successful in telephone bandwidth speech coding, but is not suitable for coding non-speech signals because of the assumed signal production model. This paper proposes an alternative approach to mixed speech/music coding, which uses a discriminator to separate music signals from speech, and codes them with the G.722 coder and a G.723.1-based speech coder, respectively. Simulations shows very promising results
Keywords :
audio coding; music; speech coding; telecommunication standards; 7 kHz; G.722 coder; G.723.1-based speech coder; assumed signal production model; audio transmission; coding quality; intelligibility; international standard; mixed wideband speech music coding; multimedia applications; naturalness; nonspeech signal; quality; speech/audio transmission; speech/music discriminator; transparent communication; videoconferencing; Bandwidth; Bit rate; Communication industry; Frequency; Image coding; Multiple signal classification; Music; Speech coding; Telephony; Wideband;
Conference_Titel :
TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications., Proceedings of IEEE
Conference_Location :
Brisbane, Qld.
Print_ISBN :
0-7803-4365-4
DOI :
10.1109/TENCON.1997.648278