DocumentCode :
2079627
Title :
Multi-speaker beamforming for voice activity classification
Author :
Tran, T.N. ; Cowley, W. ; Pollok, Andre
Author_Institution :
Inst. for Telecommun. Res., Univ. of South Australia, Adelaide, SA, Australia
fYear :
2013
fDate :
Jan. 29 2013-Feb. 1 2013
Firstpage :
116
Lastpage :
121
Abstract :
In a multi-speaker environment, voice activity classification (VAC) attempts to identify active speaker(s) at different recording periods. Using a beamformer-output-ratio (BOR) from a multi-beamforming system, an efficient solution for VAC is available by comparing the calculated BOR with pre-specified thresholds. Considering two speakers, this paper derives theoretical results on BOR statistics, including the probability distribution function and the cumulative distribution function (c.d.f.) of the BOR employing an assumption that the narrow-band signal power in the frequency domain is Gamma distributed. Using the c.d.f. of the BOR, the thresholds for VAC can be automatically calculated via a closed form expression for given acceptable mis-detection rates. The method is tested with simulated recording setups for a non-reverberant environment and a 0.3 second reverberation time environment. Both simulations show high accuracy for the classification.
Keywords :
array signal processing; frequency-domain analysis; gamma distribution; speaker recognition; BOR statistics; CDF; VAC; acceptable misdetection rates; active speaker identification; beamformer-output-ratio; closed form expression; cumulative distribution function; frequency-domain analysis; gamma distribution; multispeaker beamforming system; narrowband signal power; probability distribution function; time 0.3 s; voice activity classification; Array signal processing; Histograms; Noise; Random variables; Reverberation; Speech; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications Theory Workshop (AusCTW), 2013 Australian
Conference_Location :
Adelaide, SA
Print_ISBN :
978-1-4673-4673-3
Electronic_ISBN :
978-1-4673-4674-0
Type :
conf
DOI :
10.1109/AusCTW.2013.6510055
Filename :
6510055
Link To Document :
بازگشت