• DocumentCode
    3344075
  • Title

    Mixture Gaussian envelope chirp model for speech and audio

  • Author

    Mondal, Bishwarup ; Sreenivas, T.V.

  • Author_Institution
    Dept. of Electr. Commun. Eng., Indian Inst. of Sci., Bangalore, India
  • Volume
    2
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    857
  • Abstract
    We develop a parametric sinusoidal analysis/synthesis model which can be applied to both speech and audio signals. These signals are characterised by large amplitude variations and small frequency variation within a short analysis frame. The model comprises of a Gaussian mixture representation for the envelope and a sum of linear chirps for the frequency components. A closed form solution is derived for the frequency domain parameters of a chirp with Gaussian-mixture envelope, based on the spectral moments. An iterative algorithm is developed to select and estimate prominent chirps based on the psycho-acoustic masking threshold. The model can adaptively select the number of time-domain and frequency-domain parameters to suit a particular type of signal. Experimental evaluation of the technique has shown that about 2 to 4 parameters/ms is sufficient for near transparent quality reconstruction of a variety of wide-band music and speech signals
  • Keywords
    Gaussian processes; audio signal processing; frequency-domain analysis; iterative methods; parameter estimation; signal reconstruction; signal representation; spectral analysis; speech processing; time-domain analysis; Gaussian mixture representation; Gaussian-mixture envelope; audio signals; closed form solution; frequency components; frequency domain parameters; frequency-domain parameters; iterative algorithm; large amplitude variations; linear chirps; mixture Gaussian envelope chirp model; parametric sinusoidal analysis/synthesis model; psycho-acoustic masking threshold; quality reconstruction; small frequency variation; spectral moments; speech signals; time-domain parameters; wide-band music signals; Chirp; Closed-form solution; Frequency domain analysis; Gaussian processes; Iterative algorithms; Psychoacoustic models; Signal analysis; Signal synthesis; Speech analysis; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7041-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2001.941050
  • Filename
    941050