Title :
A comparative study of robust linear predictive analysis methods with applications to speaker identification
Author :
Ramachandran, Ravi P. ; Zilovic, Mihailo S. ; Mammone, Richard J.
Author_Institution :
CAIP Center, Rutgers Univ., Piscataway, NJ, USA
fDate :
3/1/1995 12:00:00 AM
Abstract :
Various linear predictive (LP) analysis methods are studied and compared from the points of view of robustness to noise and of application to speaker identification. The key to the success of the LP techniques is in separating the vocal tract information from the pitch information present in a speech signal even under noisy conditions. In addition to considering the conventional, one-shot weighted least-squares methods, the authors propose three other approaches with the above point as a motivation. The first is an iterative approach that leads to the weighted least absolute value solution. The second is an extension of the one-shot least-squares approach and achieves an iterative update of the weights. The update is a function of the residual and is based on minimizing a Mahalanobis distance. Third, the weighted total least-squares formulation is considered. A study of the deviations in the LP parameters is done when noise (white Gaussian and impulsive) is added to the speech. It is revealed that the most robust method depends on the type of noise. Closed-set speaker identification experiments with 20 speakers are conducted using a vector quantizer classifier trained on clean speech. The relative performance of the various LP approaches depends on the type of speech material used for testing
Keywords :
Gaussian noise; iterative methods; least squares approximations; linear predictive coding; speaker recognition; speech processing; vector quantisation; white noise; Mahalanobis distance; impulsive; iterative approach; one-shot weighted least-squares methods; pitch information; robust linear predictive analysis methods; speaker identification; speech signal; vector quantizer classifier; vocal tract information; weighted least absolute value solution; weighted total least-squares formulation; white Gaussian; Conducting materials; Filters; Gaussian noise; Iterative methods; Noise robustness; Speaker recognition; Speech analysis; Speech enhancement; Vectors; White noise;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on