DocumentCode :
336806
Title :
Log amplitude modeling of sinusoids in voiced speech
Author :
Malik, Najam ; Holmes, W. Harvey
Author_Institution :
Sch. of Electr. & Telecommun. Eng., New South Wales Univ., Sydney, NSW, Australia
Volume :
1
fYear :
1999
fDate :
15-19 Mar 1999
Firstpage :
465
Abstract :
We present an algorithm for all-pole (envelope) modeling of the amplitudes of sinusoids present in voiced speech segments which works even when the number of sinusoids is very small, as occurs with high-pitched speakers. In contrast to previous methods, this algorithm minimizes a squared error criterion in the log amplitude domain rather than the amplitude domain, and so is better matched to the properties of the human auditory system. A weighted iterative approach is used to get near optimal solutions to this otherwise nonlinear problem. This new frequency domain log amplitude modeling (LAM) algorithm gives impressive results, especially in the case of high pitched female voices where conventional linear prediction methods are inadequate. The algorithm can easily be generalized to develop pole-zero models
Keywords :
frequency-domain analysis; iterative methods; poles and zeros; spectral analysis; speech processing; all-pole modeling; envelope modeling; frequency domain; high pitched female voices; high-pitched speakers; human auditory system; linear prediction methods; log amplitude modeling algorithm; near optimal solutions; nonlinear problem; pole-zero models; sinusoids; squared error criterion; voiced speech; voiced speech segments; weighted iterative approach; Auditory system; Australia; Distortion measurement; Frequency domain analysis; Humans; Iterative algorithms; Iterative methods; Predictive models; Signal processing algorithms; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
ISSN :
1520-6149
Print_ISBN :
0-7803-5041-3
Type :
conf
DOI :
10.1109/ICASSP.1999.758163
Filename :
758163
Link To Document :
بازگشت