Title :
On the effect of snr and superdirective beamforming in speaker diarisation in meetings
Author :
Zwyssig, Erich ; Renals, Steve ; Lincoln, Mike
Author_Institution :
Centre for Speech Technol. Res., Univ. of Edinburgh, Edinburgh, UK
Abstract :
This paper examines the effect of sensor performance on speaker diarisation in meetings and investigates the use of more advanced beamforming techniques, beyond the typically employed delay-sum beamformer, for mitigating the effects of poorer sensor performance. We present superdirective beamforming and investigate how different time difference of arrival (TDOA) smoothing and beamforming techniques influence the performance of state-of-the-art diarisation systems. We produced and transcribed a new corpus of meetings recorded in the instrumented meeting room using a high SNR analogue and a newly developed low SNR digital MEMS microphone array (DMMA.2). This research demonstrates that TDOA smoothing has a significant effect on the diarisation error rate and that simple noise reduction and beamforming schemes suffice to overcome audio signal degradation due to the lower SNR of modern MEMS microphones.
Keywords :
array signal processing; audio signal processing; micromechanical devices; microphone arrays; speaker recognition; time-of-arrival estimation; DMMA.2; SNR; TDOA; audio signal degradation; delay-sum beamformer; digital MEMS microphone array; meetings; speaker diarisation; superdirective beamforming; time difference of arrival; Array signal processing; Arrays; Delay; Micromechanical devices; Microphones; Signal to noise ratio; Speech; Speaker diarisation in meetings; digital MEMS microphone array; superdirective beamforming; time difference of arrival (TDOA);
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288839