• DocumentCode
    290089
  • Title

    Mixed-phase AR models for voiced speech and perceptual cost functions

  • Author

    Gardner, William R. ; Rao, Bhaskar D.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., California Univ., San Diego, La Jolla, CA, USA
  • Volume
    i
  • fYear
    1994
  • fDate
    19-22 Apr 1994
  • Abstract
    Mixed-phase AR models are introduced for encoding the magnitudes and phases of the harmonics of voiced speech. Motivation for the use of the mixed-phase AR models is given and several cost functions are introduced, forming the basis for algorithms which estimate the model parameters. An efficient algorithm based on a quasi-linear least squares approach is presented, and a more sophisticated algorithm based on the perceptual masking properties of the ear is described. When the algorithms are used to model voiced speech signals using a 14th order mixed-phase model, high quality speech can be produced
  • Keywords
    autoregressive processes; ear; harmonics; least mean squares methods; parameter estimation; speech coding; speech intelligibility; algorithms; ear; harmonics; high quality speech; magnitudes; mixed-phase AR models; model parameters estimation; perceptual cost functions; perceptual masking properties; phases; quasi-linear least squares; speech coding; voiced speech signals; Acoustic pulses; Cost function; Finite impulse response filter; Frequency domain analysis; Integrated circuit modeling; Phase measurement; Power harmonic filters; Pulse shaping methods; Shape; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
  • Conference_Location
    Adelaide, SA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-1775-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1994.389319
  • Filename
    389319