DocumentCode :
1732311
Title :
Compressed sensing based scalable speech coders
Author :
Daniels, M.L. ; Rao, Bhaskar
Author_Institution :
Dept. of Music, Univ. of California, San Diego, La Jolla, CA, USA
fYear :
2012
Firstpage :
92
Lastpage :
96
Abstract :
Code-excited linear prediction (CELP) is one of the most commonly used approaches for speech compression. The CELP coder, in particular the ACELP coder, relies on extracting an appropriate excitation sequence that is passed through a long-term and short-term linear prediction filter to synthesize the speech. One limitation of the analysis-by-synthesis search methods employed in the ACELP coder is that the positions of the non-zero entries in the excitation sequence and their gains are limited. Increasing the richness of the excitation to improve the speech quality is not only accompanied by the usual increase in the bit rate but also by a significant increase in search complexity. We propose dealing with this scalability issue by using tools from the compressed sensing (CS) domain. An analysis by synthesis coder using a CS based excitation sequence framework is developed and the encoder is shown to scale gracefully with CS dimension and offers a mechanism for quality to bit rate trade off.
Keywords :
data compression; filters; speech coding; speech synthesis; vocoders; ACELP coder; code-excited linear prediction; compressed sensing domain; excitation sequence framework; linear prediction filter; scalable speech coders; search complexity; speech compression; speech quality; speech synthesis; synthesis coder;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signals, Systems and Computers (ASILOMAR), 2012 Conference Record of the Forty Sixth Asilomar Conference on
Conference_Location :
Pacific Grove, CA
ISSN :
1058-6393
Print_ISBN :
978-1-4673-5050-1
Type :
conf
DOI :
10.1109/ACSSC.2012.6488965
Filename :
6488965
Link To Document :
بازگشت