DocumentCode :
1125722
Title :
Determining Mixing Parameters From Multispeaker Data Using Speech-Specific Information
Author :
Yegnanarayana, B. ; Swamy, R. Kumara ; Murty, K. Sri Rama
Author_Institution :
Int. Inst. of Inf. Technol., Hyderabad, India
Volume :
17
Issue :
6
fYear :
2009
Firstpage :
1196
Lastpage :
1207
Abstract :
In this paper, we propose an approach for processing multispeaker speech signals collected simultaneously using a pair of spatially separated microphones in a real room environment. Spatial separation of microphones results in a fixed time-delay of arrival of speech signals from a given speaker at the pair of microphones. These time-delays are estimated by exploiting the impulse-like characteristic of excitation during speech production. The differences in the time-delays for different speakers are used to determine the number of speakers from the mixed multispeaker speech signals. There is difference in the signal levels due to differences in the distances between the speaker and each of the microphones. The differences in the signal levels dictate the values of the mixing parameters. Knowledge of speech production, especially the excitation source characteristics, is used to derive an approximate weight function for locating the regions specific to a given speaker. The scatter plots of the weighted and delay-compensated mixed speech signals are used to estimate the mixing parameters. The proposed method is applied on the data collected in actual laboratory environment for an underdetermined case, where the number of speakers is more than the number of microphones. Enhancement of speech due to a speaker is also examined using the information of the time-delays and the mixing parameters, and is evaluated using objective measures proposed in the literature.
Keywords :
microphones; speech enhancement; microphones; mixing parameters; multispeaker data; multispeaker speech signal processing; speech enhancement; speech production; speech specific information; Acoustic sensors; Acoustical engineering; Delay estimation; Finite impulse response filter; Microphones; Nonlinear filters; Signal generators; Signal processing; Source separation; Speech processing; Excitation source; mixing parameters; multispeaker data; speaker localization; time-delay estimation;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2009.2016230
Filename :
5153554
Link To Document :
بازگشت