Title :
Spectral mask estimation using deep neural networks for inter-sensor data ratio model based robust DOA estimation
Author :
Zheng, W.Q. ; Zou, Y.X. ; Ritz, C.
Author_Institution :
Sch. of ECE, Peking Univ., Shenzhen, China
Abstract :
Accurate DOA estimation based on clustering the inter-sensor data ratios (ISDRs) of a single acoustic vector sensor (AVS), referred as AVS-ISDR, relies on reliable extraction of time-frequency points with high local signal-to-noise ratio (HLSNR-TFPs) and its performance degrades in noisy environments. This paper investigates deep neural networks (DNNs) trained with noisy-clean speech pairs under different SNR levels and noise types to improve the performance of AVS-ISDR in noise conditions. The DNNs is trained to learn characteristics reflecting the level of speech information at different TFPs, which helps to generate a reliable spectral mask for obtaining a noise-reduced spectral. Correspondingly, a robust DOA estimation algorithm named as AVS-DNN-ISDR has been developed. Experimental results verify the proposed DNN-based spectral mask improves the reliable HLSNR-TFPs extraction at different SNR levels. Results from simulations and real AVS recordings further validate AVS-DNN-ISDR achieving high DOA estimation accuracy even when the SNR is lower than 0dB.
Keywords :
direction-of-arrival estimation; neural nets; speech processing; time-frequency analysis; AVS-DNN-ISDR; acoustic vector sensor; deep neural networks; high local signal-to-noise ratio; inter-sensor data ratio model based robust DOA estimation; noise conditions; noisy environments; noisy-clean speech pairs; spectral mask estimation; speech information; time-frequency points; Acoustics; Direction-of-arrival estimation; Estimation; Noise measurement; Signal to noise ratio; Speech; Direction of arrival estimation; acoustic vector sensor; deep neural networks; inter-sensor data ratios; spectral mask estimation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7177984