DocumentCode :
3528504
Title :
Joint time delay and pitch estimation for speaker localization
Author :
Ngan, L.Y. ; Wu, Y. ; So, H.C. ; Ching, P.C. ; Lee, S.W.
Author_Institution :
Dept. of Electron. Eng., Chinese Univ. of Hong Kong, China
Volume :
3
fYear :
2003
fDate :
25-28 May 2003
Abstract :
In this paper, we attempt to develop an efficient and accurate algorithm for joint time delay and pitch estimation of a speech signal received at a microphone array. The time delay measurement allows a speaker to be located while the detection of the pitch frequency is useful for analyzing the acoustic properties of the sound. A subspace method based on state-space realization is first introduced for joint time delay and frequency estimation of a synthetic signal consisting of several frequency components. The frequency estimates are obtained directly from the eigenvalues of the state transition matrix whilst the time delay is approximated from the observation matrix and the estimated frequencies using a least square approach. The method is then extended to track both the time delay and pitch frequency of a speech signal, which is modeled by a summation of sinusoids that are harmonically related to the fundamental frequency (pitch) and spectrally shaped by the vocal tract transfer function. Extensive simulation tests have been done to validate the effectiveness and accuracy of the proposed algorithm.
Keywords :
covariance matrices; delay estimation; direction-of-arrival estimation; eigenvalues and eigenfunctions; frequency estimation; least squares approximations; speaker recognition; state-space methods; acoustic properties analysis; efficient accurate algorithm; frequency estimation; least square approach; microphone array; observation matrix; pitch estimation; pitch frequency detection; simulation tests; sinusoid summation; speaker localization; speech signal; state transition matrix eigenvalues; state-space realization; subspace method; synthetic signal; time delay; vocal tract transfer function; Acoustic measurements; Delay effects; Delay estimation; Frequency estimation; Frequency measurement; Least squares approximation; Microphone arrays; Speech; State estimation; Time measurement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits and Systems, 2003. ISCAS '03. Proceedings of the 2003 International Symposium on
Print_ISBN :
0-7803-7761-3
Type :
conf
DOI :
10.1109/ISCAS.2003.1205121
Filename :
1205121
Link To Document :
بازگشت