Title :
A phonetically switched ADPCM speech coder
Author :
Ramadas, Pravin ; Gibson, Jerry D.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of California, Santa Barbara, CA
Abstract :
Voice activity detection and phonetically-based segmentation are used to classify input speech into four modes: onset, silence, unvoiced and voiced. Each phonetic segment is coded at a suitable bitrate depending on the mode type, using a G.726 ADPCM encoder and preserving distinct encoder state information for each mode. The proposed speech coder achieves PESQ-MOS equivalent to G.726 ADPCM at 24 kbps but at an average rate less than 16 kbps while encoding a typical telephone conversation. A moderate 40 ms encoder delay is incurred.
Keywords :
adaptive modulation; differential pulse code modulation; speech coding; G.726 ADPCM encoder; PESQ-MOS; bit rate 24 kbit/s; phonetic segment; phonetically based segmentation; phonetically switched ADPCM speech coder; speech classification; voice activity detection; Speech;
Conference_Titel :
Signals, Systems and Computers, 2008 42nd Asilomar Conference on
Conference_Location :
Pacific Grove, CA
Print_ISBN :
978-1-4244-2940-0
Electronic_ISBN :
1058-6393
DOI :
10.1109/ACSSC.2008.5074818