• DocumentCode
    294622
  • Title

    Spectral excitation coding of speech at 2.4 kb/s

  • Author

    Cuperman, V. ; Lupini, P. ; Bhattacharya, B.

  • Author_Institution
    Sch. of Eng. Sci., Simon Fraser Univ., Burnaby, BC, Canada
  • Volume
    1
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    496
  • Abstract
    We present spectral excitation coding (SEC), a speech codec based on a sinusoidal model applied to the excitation signal. A phase dispersion algorithm allows the same model to be used for voiced as well as unvoiced and transitional sounds. The phase dispersion algorithm significantly improves the perceived quality resulting in more natural reconstructed speech. A new technique for variable dimension vector quantization called nonsquare transform vector quantization (NSTVQ) is used for quantization of the harmonic magnitudes. The SEC system at 2.45 kb/s achieved an MOS score 0.8 points higher than the 2.4 kb/s ZPC-10 standard. A preliminary 1.85 kb/s SEC system which uses zero-bit magnitude quantization is also presented. Informal listening tests indicate that the quality of the 1.85 kb/s system exceeds that of the LPC-10 standard
  • Keywords
    signal reconstruction; spectral analysis; speech codecs; speech coding; speech intelligibility; vector quantisation; 2.4 kbit/s; LPC-10 standard; MOS score; harmonic magnitudes quantization; informal listening tests; natural reconstructed speech; nonsquare transform vector quantization; perceived speech quality; phase dispersion algorithm; sinusoidal model; spectral excitation coding; speech codec; speech coding; transitional sounds; unvoiced sounds; variable dimension vector quantization; zero-bit magnitude quantization; Codecs; Decoding; Encoding; Frequency; Linear predictive coding; Nonlinear filters; Signal synthesis; Speech codecs; Speech coding; Speech synthesis; System testing; Vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479637
  • Filename
    479637