DocumentCode
463422
Title
Laplace Entropy and its Application to Time Delay Estimation for Speech Signals
Author
Huang, Yiteng Arden ; Benesty, Jacob ; Chen, Jingdong
Author_Institution
Lucent Technol. Bell Labs., Murray Hill, NJ
Volume
1
fYear
2007
fDate
15-20 April 2007
Abstract
Time delay estimation (TDE) is a basic technique for numerous applications where there is a need to localize and track a radiating source. It is particularly challenging in the presence of noise and reverberation, and when the source signal is speech which is inherently nonstationary and random. The most important TDE algorithms for two sensors are based on the generalized cross-correlation (GCC) method. These algorithms perform reasonably well when reverberation or noise is not too high. In an earlier study of the authors, a more sophisticated approach was proposed. It employs more sensors and takes advantage of their delay redundancy to improve the precision of the TDOA (time difference of arrival) estimate between the first two sensors. The approach is based on the multichannel cross-correlation coefficient (MCCC) and was found more robust to noise and reverberation. In this paper, we show that this approach can also be developed on a basis of joint entropy. For Gaussian signals, we show that, in the search of the TDOA estimate, maximizing MCCC is equivalent to minimizing joint entropy. But with the generalization of the idea to non-Gaussian speech signals, the joint entropy based new multichannel TDE algorithm manifests a potential to outperform the MCCC-based method. Since there is no rigorous mathematical formula for speech entropy, we use the assumption that speech can be plausibly modeled by a Laplace distribution and develop a practical approximation of Laplace entropy for TDE of speech signals. The performance of the proposed new algorithm is investigated via simulations.
Keywords
Gaussian processes; Laplace equations; correlation methods; delays; speech processing; time-of-arrival estimation; Gaussian signals; Laplace entropy approximation; generalized cross-correlation method; joint entropy minimization; mathematical formula; multichannel cross-correlation coefficient; speech signals; time delay estimation; time difference of arrival; Delay effects; Delay estimation; Entropy; Microphones; Noise robustness; Radar tracking; Reverberation; Speech enhancement; Time difference of arrival; Working environment noise; Laplace distribution; Time delay estimation; entropy; multichannel cross-correlation coefficient;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location
Honolulu, HI
ISSN
1520-6149
Print_ISBN
1-4244-0727-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2007.366629
Filename
4217029
Link To Document