Mixed-phase AR models for voiced speech and perceptual cost functions

Author

Gardner, William R. ; Rao, Bhaskar D.

Author_Institution

Dept. of Electr. & Comput. Eng., California Univ., San Diego, La Jolla, CA, USA

Volume

i

fYear

1994

fDate

19-22 Apr 1994

Abstract

Mixed-phase AR models are introduced for encoding the magnitudes and phases of the harmonics of voiced speech. Motivation for the use of the mixed-phase AR models is given and several cost functions are introduced, forming the basis for algorithms which estimate the model parameters. An efficient algorithm based on a quasi-linear least squares approach is presented, and a more sophisticated algorithm based on the perceptual masking properties of the ear is described. When the algorithms are used to model voiced speech signals using a 14th order mixed-phase model, high quality speech can be produced

Keywords

autoregressive processes; ear; harmonics; least mean squares methods; parameter estimation; speech coding; speech intelligibility; algorithms; ear; harmonics; high quality speech; magnitudes; mixed-phase AR models; model parameters estimation; perceptual cost functions; perceptual masking properties; phases; quasi-linear least squares; speech coding; voiced speech signals; Acoustic pulses; Cost function; Finite impulse response filter; Frequency domain analysis; Integrated circuit modeling; Phase measurement; Power harmonic filters; Pulse shaping methods; Shape; Speech;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on

Conference_Location

Adelaide, SA

ISSN

1520-6149

Print_ISBN

0-7803-1775-0

Type

conf

DOI

10.1109/ICASSP.1994.389319

Filename

389319