DocumentCode
2712182
Title
Solving global permutation ambiguity of time domain BSS using speaker specific features of speech signals
Author
Khanagha, Vahid ; Khanagha, Ali
Author_Institution
Iran Univ. of Sci. & Technol., Tehran, Iran
Volume
2
fYear
2009
fDate
4-6 Oct. 2009
Firstpage
1007
Lastpage
1011
Abstract
Multidimensional localization of multiple sources using BSS based TDOA estimators, requires the solution of global permutation ambiguity before fusing several TDOA estimations. Since the separation quality of BSS isn´t always perfect, it is not easy to decide which TDOA belongs to which source. Here we study the possibility of using several speaker specific features of speech signal in order to recognize perceptually dominant sources in each one of moderately separated outputs of BSS algorithm. We compare the feasibility of different features in terms of validity rate of decisions and computational complexity.
Keywords
computational complexity; direction-of-arrival estimation; speech processing; time-domain analysis; TDOA estimators; computational complexity; global permutation ambiguity; multidimensional localization; speaker specific features; speech signals; time domain BSS; validity rate; Computational complexity; Data mining; Frequency; Industrial electronics; Microphone arrays; Predictive models; Production systems; Sensor arrays; Speech processing; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Industrial Electronics & Applications, 2009. ISIEA 2009. IEEE Symposium on
Conference_Location
Kuala Lumpur
Print_ISBN
978-1-4244-4681-0
Electronic_ISBN
978-1-4244-4683-4
Type
conf
DOI
10.1109/ISIEA.2009.5356310
Filename
5356310
Link To Document