• DocumentCode
    3360889
  • Title

    The two-dimensional discrete cosine transform applied to speech data

  • Author

    Baghai-Ravary, L. ; Beet, S.W. ; Tokhi, M.O.

  • Author_Institution
    Dept. of Electron. & Electr. Eng., Sheffield Univ., UK
  • Volume
    1
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    244
  • Abstract
    A two-dimensional discrete cosine transform (2-D DCT), often used for image coding, has been applied to sequences of speech spectra produced by the maximum likelihood method (MLM). The coded data was compressed by nearly 90%, reducing it to a size smaller than that needed to store the coefficients of a 10th order linear predictive coding (LPC) model. The DCT-encoded data was then reconstructed and tested for intelligibility. It was found that the two-dimensional DCT method was significantly more intelligible and more natural-sounding than the LPC technique
  • Keywords
    discrete cosine transforms; maximum likelihood estimation; speech coding; speech intelligibility; transform coding; 2D DCT; DCT encoded data reconstruction; LPC; data compression; linear predictive coding; maximum likelihood method; speech coding; speech data; speech intelligibility; speech spectra sequences; two-dimensional discrete cosine transform; Discrete cosine transforms; Discrete transforms; Filters; Image coding; Image reconstruction; Linear predictive coding; Spectrogram; Speech coding; Technological innovation; Two dimensional displays;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.540403
  • Filename
    540403