• DocumentCode
    3593327
  • Title

    Speech compression with preservation of speaker identity

  • Author

    Leis, John ; Phythian, Mark ; Sridharan, Sridha

  • Author_Institution
    Fac. of Eng., Univ. of Southern Queensland, Toowoomba, Qld., Australia
  • Volume
    3
  • fYear
    1997
  • Firstpage
    1711
  • Abstract
    Although much effort has been directed recently towards speech compression at rates below 4 kb/s, the primary metric for comparison has, understandably, been the amount of spectral distortion in the decompressed speech. However, an aspect which is becoming important in some applications is the ability to identify the original speaker from the coded speech algorithmically. We investigate here the effect of speech compression using multistage vector quantization of the short-term (formant) filter parameters on text-independent speaker identification. It is demonstrated that in cases where the speech is stored in a compressed database for retrieval, the speaker model should be constructed from the raw speech before spectral compression. Additionally, Gaussian models of sufficiently high order are able to reduce the negative effects of spectral vector quantization upon speaker identification accuracy
  • Keywords
    Gaussian processes; data compression; filtering theory; signal representation; speaker recognition; spectral analysis; speech coding; vector quantisation; Gaussian models; VQ; coded speech; compressed database; decompressed speech; formant filter parameters; multistage vector quantization; raw speech; short term filter parameters; speaker identification accuracy; speaker identity preservation; speaker model; spectral compression; spectral distortion; spectral vector quantization; spectrum representation; speech compression; text independent speaker identification; Australia; Databases; Filters; Laboratories; Phase change materials; Predictive models; Signal processing; Speech coding; Speech processing; Vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.598850
  • Filename
    598850