Title :
Wideband Speech Coding Advances in VMR-WB Standard
Author :
Jelínek, Milan ; Salami, Redwan
Author_Institution :
Dept. of Electr. & Comput. Eng., Sherbrooke Univ., Que.
fDate :
5/1/2007 12:00:00 AM
Abstract :
This paper presents novel techniques for source-controlled variable-rate wideband speech coding. These techniques have been used in the variable-rate multimode wideband (VMR-WB) speech codec recently selected by the Third-Generation Partnership Project 2 (3GPP2) for wideband (WB) speech telephony, streaming, and multimedia messaging services in the cdma2000 third-generation wireless system. The codec utilizes efficient coding modes optimized for different classes of speech signal including generic coding based on AMR-WB for transients and onsets, voiced coding optimized for stable voiced signals, unvoiced coding optimized for unvoiced segments, and comfort noise generation for inactive segments. Several innovations enable very good performance at average bit rates below 8 kb/s for active speech coding. The article presents an overview of the codec and describes in detail some of the codec novel features: Robust pitch tracking algorithm, coding-mode dependent prediction of linear prediction (LP) filter quantization, and novel frame erasure concealment techniques including supplementary information for reconstruction of lost onsets and improving decoder convergence. Selected results from the Selection and Characterization tests of the codec illustrate its performance
Keywords :
3G mobile communication; broadband networks; electronic messaging; filtering theory; linear predictive coding; media streaming; speech coding; voice communication; 3GPP2; Third-Generation Partnership Project 2; VMR-WB standard; cmda2000; coding-mode dependent prediction; frame erasure concealment techniques; generic coding; linear prediction filter quantization; multimedia messaging services; robust pitch tracking algorithm; source-controlled variable-rate coding; speech signal; stable voiced signals; streaming; unvoiced coding; variable-rate multimode wideband speech codec; wideband speech coding; wideband speech telephony; Message service; Multimedia systems; Noise generators; Signal generators; Speech codecs; Speech coding; Speech enhancement; Streaming media; Telephony; Wideband; Linear predictive coding; standardization; variable-rate speech coding; wideband speech coding;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2007.894514