Title :
Joint time delay and pitch estimation for speaker localization
Author :
Ngan, L.Y. ; Wu, Y. ; So, H.C. ; Ching, P.C. ; Lee, S.W.
Author_Institution :
Dept. of Electron. Eng., Chinese Univ. of Hong Kong, China
Abstract :
In this paper, we attempt to develop an efficient and accurate algorithm for joint time delay and pitch estimation of a speech signal received at a microphone array. The time delay measurement allows a speaker to be located while the detection of the pitch frequency is useful for analyzing the acoustic properties of the sound. A subspace method based on state-space realization is first introduced for joint time delay and frequency estimation of a synthetic signal consisting of several frequency components. The frequency estimates are obtained directly from the eigenvalues of the state transition matrix whilst the time delay is approximated from the observation matrix and the estimated frequencies using a least square approach. The method is then extended to track both the time delay and pitch frequency of a speech signal, which is modeled by a summation of sinusoids that are harmonically related to the fundamental frequency (pitch) and spectrally shaped by the vocal tract transfer function. Extensive simulation tests have been done to validate the effectiveness and accuracy of the proposed algorithm.
Keywords :
covariance matrices; delay estimation; direction-of-arrival estimation; eigenvalues and eigenfunctions; frequency estimation; least squares approximations; speaker recognition; state-space methods; acoustic properties analysis; efficient accurate algorithm; frequency estimation; least square approach; microphone array; observation matrix; pitch estimation; pitch frequency detection; simulation tests; sinusoid summation; speaker localization; speech signal; state transition matrix eigenvalues; state-space realization; subspace method; synthetic signal; time delay; vocal tract transfer function; Acoustic measurements; Delay effects; Delay estimation; Frequency estimation; Frequency measurement; Least squares approximation; Microphone arrays; Speech; State estimation; Time measurement;
Conference_Titel :
Circuits and Systems, 2003. ISCAS '03. Proceedings of the 2003 International Symposium on
Print_ISBN :
0-7803-7761-3
DOI :
10.1109/ISCAS.2003.1205121