• DocumentCode
    1694618
  • Title

    Speech coding based on pitch synchrony and two-stage transformation

  • Author

    Xiao-ming Li ; Chang-chun Bao ; Kleijn, W. Bastiaan

  • Author_Institution
    Speech & Audio Signal Process. Lab., Beijing Univ. of Technol., Beijing, China
  • fYear
    2013
  • Firstpage
    8159
  • Lastpage
    8163
  • Abstract
    In this paper, an effective speech coder that is based on a sparse representation of speech by exploiting the strong dependencies between adjacent pitch cycles is proposed. In the proposed coder, a pitch-synchronous processing that consists of pitch warping and a two-stage transformation is used to achieve a compact representation of the voiced speech. Power spectral density preserving quantization (PSD-PQ) is adopted for quantizing the transform coefficients. The result is a coder that is efficient over a wide range of bit rates: it approaches perfect reconstruction with increasing rate, and has a parametric signal representation at low rates. Both objective PESQ results and subjective A/B listening tests show that the proposed coder outperforms the ITU-T G.722.1 codec.
  • Keywords
    codecs; signal representation; speech coding; ITU-T G.722.1 codec; parametric signal representation; pitch cycles; pitch synchrony; pitch warping; pitch-synchronous processing; power spectral density preserving quantization; sparse representation; speech coder; speech coding; transform coefficients; two-stage transformation; voiced speech; Bit rate; Modulation; Quantization (signal); Speech; Speech coding; Speech processing; Transforms; Speech coding; compact representation; pitch-synchronous; quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639255
  • Filename
    6639255