Title :
Using long-term information to improve robustness in Speaker Identification
Author :
Lyons, James G. ; O´Connell, James G. ; Paliwal, Kuldip K.
Author_Institution :
Signal Process. Lab., Griffith Univ., Brisbane, QLD, Australia
Abstract :
In this paper we propose two new methods of improving the robustness of Automatic Speaker Identification systems. These methods rely on using long-term information in the speech signal to improve the robustness of the features. The first method involves averaging filterbank parameters from consecutive short-time frames over a longer window. The second method investigates the use of frame lengths longer than generally assumed stationary. We show that these two methods result in an improvement over standard Mel Frequency Cepstral Coefficients in the presence of additive white Gaussian noise in speaker identification applications. Furthermore, additional improvements are observed at mid-range SNR when the proposed methods are used in combination.
Keywords :
AWGN; channel bank filters; speaker recognition; Mel frequency cepstral coefficients; SNR; additive white Gaussian noise; automatic speaker identification systems; averaging filterbank parameters; speech signal; Acceleration; Accuracy; Mel frequency cepstral coefficient; Robustness; Signal to noise ratio; Speech; Speech recognition; Feature averaging; analysis window duration; automatic speaker identification; long window; speaker recognition;
Conference_Titel :
Signal Processing and Communication Systems (ICSPCS), 2010 4th International Conference on
Conference_Location :
Gold Coast, QLD
Print_ISBN :
978-1-4244-7908-5
Electronic_ISBN :
978-1-4244-7906-1
DOI :
10.1109/ICSPCS.2010.5709772