DocumentCode
3422018
Title
Multisensor very lowbit rate speech coding using segment quantization
Author
McCree, Alan ; Brady, Kevin ; Quatieri, Thomas F.
Author_Institution
Lincoln Lab., MIT, Lexington, MA
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
3997
Lastpage
4000
Abstract
We present two approaches to noise robust very low bit rate speech coding using wideband MELP analysis/synthesis. Both methods exploit multiple acoustic and non-acoustic input sensors, using our previously-presented dynamic waveform fusion algorithm to simultaneously perform waveform fusion, noise suppression, and cross-channel noise cancellation. One coder uses a 600 bps scalable phonetic vocoder, with a phonetic speech recognizer followed by joint predictive vector quantization of the error in wideband MELP parameters. The second coder operates at 300 bps with fixed 80 ms segments, using novel variable-rate multistage matrix quantization techniques. Formal test results show that both coders achieve equivalent intelligibility to the 2.4 kbps NATO standard MELPe coder in harsh acoustic noise environments, at much lower bit rates, with only modest quality loss.
Keywords
matrix algebra; sensor fusion; speech coding; speech recognition; vector quantisation; vocoders; acoustic noise environments; cross-channel noise cancellation; dynamic waveform fusion algorithm; joint predictive vector quantization; multisensor very low bit rate speech coding; noise suppression; nonacoustic input sensors; phonetic speech recognizer; phonetic vocoder; segment quantization; variable-rate multistage matrix quantization techniques; waveform fusion; Acoustic noise; Acoustic waves; Bit rate; Noise cancellation; Noise robustness; Quantization; Speech analysis; Speech coding; Speech synthesis; Wideband; MELP; Nonacoustic sensor; phonetic vocoder; vector quantization;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4518530
Filename
4518530
Link To Document