مرکز منطقه ای اطلاع رساني علوم و فناوري - A perceptual subspace approach for modeling of speech and audio signals with damped sinusoids

DocumentCode :

950788

Title :

A perceptual subspace approach for modeling of speech and audio signals with damped sinusoids

Author :

Jensen, Jesper ; Heusdens, Richard ; Jensen, Søren Holdt

Author_Institution :

Dept. of Mediamatics, Delft Univ. of Technol., Netherlands

Volume :

Issue :

fYear :

2004

fDate :

3/1/2004 12:00:00 AM

Firstpage :

121

Lastpage :

132

Abstract :

The problem of modeling a signal segment as a sum of exponentially damped sinusoidal components arises in many different application areas, including speech and audio processing. Often, model parameters are estimated using subspace based techniques which arrange the input signal in a structured matrix and exploit the so-called shift-invariance property related to certain vector spaces of the input matrix. A problem with this class of estimation algorithms, when used for speech and audio processing, is that the perceptual importance of the sinusoidal components is not taken into account. In this work we propose a solution to this problem. In particular, we show how to combine well-known subspace based estimation techniques with a recently developed perceptual distortion measure, in order to obtain an algorithm for extracting perceptually relevant model components. In analysis-synthesis experiments with wideband audio signals, objective and subjective evaluations show that the proposed algorithm improves perceived signal quality considerable over traditional subspace based analysis methods.

Keywords :

audio signal processing; matrix algebra; speech processing; audio processing; audio signal modeling; damped sinusoid; input matrix; model parameter estimation; perceptual distortion measure; perceptual subspace; shift invariance property; signal quality; signal segment; speech processing; speech signal modeling; structured matrix; vector spaces; wideband audio signals; Distortion measurement; Matching pursuit algorithms; Parameter estimation; Psychoacoustic models; Signal analysis; Signal processing; Signal synthesis; Speech coding; Speech processing; Speech synthesis;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/TSA.2003.819948

Filename :

1284340

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=950788