مرکز منطقه ای اطلاع رساني علوم و فناوري - Near-field source extraction using speech presence probabilities for ad hoc microphone arrays

DocumentCode :

151250

Title :

Near-field source extraction using speech presence probabilities for ad hoc microphone arrays

Author :

Taseska, Maja ; Markovich-Golan, Shmulik ; Habets, Emanuel A. P. ; Gannot, Sharon

Author_Institution :

Int. Audio Labs. Erlangen, Erlangen, Germany

fYear :

2014

fDate :

8-11 Sept. 2014

Firstpage :

169

Lastpage :

173

Abstract :

Ad hoc wireless acoustic sensor networks (WASNs) hold great potential for improved performance in speech processing applications, thanks to better coverage and higher diversity of the received signals. We consider a multiple speaker scenario where each of the WASN nodes, an autonomous system comprising of sensing, processing and communicating capabilities, is positioned in the near-field of one of the speakers. Each node aims at extracting its nearest speaker while suppressing other speakers and noise. The ad hoc network is characterized by an arbitrary number of speakers/nodes with uncontrolled microphone constellation. In this paper we propose a distributed algorithm which shares information between nodes. The algorithm requires each node to transmit a single audio channel in addition to a soft time-frequency (TF) activity mask for its nearest speaker. The TF activity masks are computed as a combination of estimates of a model-based speech presence probability (SPP), direct to reverberant ratio (DRR) and direction of arrival (DOA) per TF bin. The proposed algorithm, although sub-optimal compared to the centralized solution, is superior to the single-node solution.

Keywords :

ad hoc networks; direction-of-arrival estimation; microphone arrays; noise abatement; probability; reverberation; speaker recognition; time-frequency analysis; wireless sensor networks; DOA; DRR; SPP; TF activity; TF bin; WASN nodes; ad hoc microphone arrays; ad hoc wireless acoustic sensor networks; direct to reverberant ratio; direction of arrival; distributed algorithm; information sharing; microphone constellation; model-based speech presence probability; near-field source extraction; nearest speaker extraction; noise suppression; single audio channel; single-node solution; soft time-frequency activity; speech processing applications; Acoustics; Covariance matrices; Direction-of-arrival estimation; Microphones; Noise; Speech; Vectors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustic Signal Enhancement (IWAENC), 2014 14th International Workshop on

Conference_Location :

Juan-les-Pins

Type :

conf

DOI :

10.1109/IWAENC.2014.6954000

Filename :

6954000

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=151250