DocumentCode :
960062
Title :
Determining Number of Speakers From Multispeaker Speech Signals Using Excitation Source Information
Author :
Swamy, R. Kumara ; Murty, K. Sri Rama ; Yegnanarayana, B.
Author_Institution :
Indian Inst. of Technol.-Madras, Chennai
Volume :
14
Issue :
7
fYear :
2007
fDate :
7/1/2007 12:00:00 AM
Firstpage :
481
Lastpage :
484
Abstract :
In this letter, we address the issue of determining the number of speakers from multispeaker speech signals collected simultaneously using a pair of spatially separated microphones. The spatial separation of the microphones results in time delay of arrival of speech signals from a given speaker. The differences in the time delays for different speakers are exploited to determine the number of speakers from the multispeaker signals. The key idea is that for a given speaker, the relative spacings of the instants of significant excitation of the vocal tract system remain unchanged in the direct components of the speech signals at the two microphones. The time delays can be estimated from the cross-correlation of the Hilbert envelopes of the linear prediction residuals of the multispeaker signals collected at the two microphones.
Keywords :
correlation methods; microphones; speech processing; Hilbert envelopes; cross correlation; excitation source information; linear prediction residuals; multispeaker speech signals; spatially separated microphones; time delay of arrival; vocal tract system; Additive noise; Covariance matrix; Delay effects; Eigenvalues and eigenfunctions; Microphones; Noise robustness; Sensor arrays; Signal processing; Speech; Testing; Excitation source; Hilbert envelope; linear prediction residual; multispeaker signals; time-delay estimation; underdetermined case;
fLanguage :
English
Journal_Title :
Signal Processing Letters, IEEE
Publisher :
ieee
ISSN :
1070-9908
Type :
jour
DOI :
10.1109/LSP.2006.891333
Filename :
4244497
Link To Document :
بازگشت