DocumentCode :
2207304
Title :
A comparison of sinusoidal model variants for speech and audio representation
Author :
Jensen, Jesper ; Heusdens, Richard
Author_Institution :
Dept. of Mediamatics, Tech. Univ. of Delft, Delft, Netherlands
fYear :
2002
fDate :
3-6 Sept. 2002
Firstpage :
1
Lastpage :
4
Abstract :
Two sinusoidal model variants for speech and audio representation are compared: the traditional constant-amplitude, constant-frequency sinusoidal model, and a generalized model where amplitudes can vary exponentially with time. Two classes of methods for estimation of model parameters are reviewed: matching pursuit (MP) and subspace based schemes. Furthermore, Newton optimized versions of these schemes are included in the study. The influence of model type and parameter estimation scheme on model performance was evaluated in simulation experiments with audio and speech signals. As expected, the exponential model outperforms the traditional sinusoidal model in segments with large signal level variations. For the non-optimized estimation schemes, the subspace method generally performs better than the MP method (an SNR gain of 2-7 dB was observed). Newton optimization improves the modeling performance significantly in all cases, and results in slightly better performance with MP (an SNR gain of 1-2 dB) compared to the subspace method.
Keywords :
Newton method; speech recognition; MP; Newton optimized versions; audio representation; audio signals; constant frequency sinusoidal model; constant-amplitude model; exponential model; generalized model; matching pursuit; sinusoidal model variants; speech representation; speech signals; subspace based schemes; subspace method; Abstracts;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2002 11th European
Conference_Location :
Toulouse
ISSN :
2219-5491
Type :
conf
Filename :
7070771
Link To Document :
بازگشت