• DocumentCode
    1732311
  • Title

    Compressed sensing based scalable speech coders

  • Author

    Daniels, M.L. ; Rao, Bhaskar

  • Author_Institution
    Dept. of Music, Univ. of California, San Diego, La Jolla, CA, USA
  • fYear
    2012
  • Firstpage
    92
  • Lastpage
    96
  • Abstract
    Code-excited linear prediction (CELP) is one of the most commonly used approaches for speech compression. The CELP coder, in particular the ACELP coder, relies on extracting an appropriate excitation sequence that is passed through a long-term and short-term linear prediction filter to synthesize the speech. One limitation of the analysis-by-synthesis search methods employed in the ACELP coder is that the positions of the non-zero entries in the excitation sequence and their gains are limited. Increasing the richness of the excitation to improve the speech quality is not only accompanied by the usual increase in the bit rate but also by a significant increase in search complexity. We propose dealing with this scalability issue by using tools from the compressed sensing (CS) domain. An analysis by synthesis coder using a CS based excitation sequence framework is developed and the encoder is shown to scale gracefully with CS dimension and offers a mechanism for quality to bit rate trade off.
  • Keywords
    data compression; filters; speech coding; speech synthesis; vocoders; ACELP coder; code-excited linear prediction; compressed sensing domain; excitation sequence framework; linear prediction filter; scalable speech coders; search complexity; speech compression; speech quality; speech synthesis; synthesis coder;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signals, Systems and Computers (ASILOMAR), 2012 Conference Record of the Forty Sixth Asilomar Conference on
  • Conference_Location
    Pacific Grove, CA
  • ISSN
    1058-6393
  • Print_ISBN
    978-1-4673-5050-1
  • Type

    conf

  • DOI
    10.1109/ACSSC.2012.6488965
  • Filename
    6488965