• DocumentCode
    2712182
  • Title

    Solving global permutation ambiguity of time domain BSS using speaker specific features of speech signals

  • Author

    Khanagha, Vahid ; Khanagha, Ali

  • Author_Institution
    Iran Univ. of Sci. & Technol., Tehran, Iran
  • Volume
    2
  • fYear
    2009
  • fDate
    4-6 Oct. 2009
  • Firstpage
    1007
  • Lastpage
    1011
  • Abstract
    Multidimensional localization of multiple sources using BSS based TDOA estimators, requires the solution of global permutation ambiguity before fusing several TDOA estimations. Since the separation quality of BSS isn´t always perfect, it is not easy to decide which TDOA belongs to which source. Here we study the possibility of using several speaker specific features of speech signal in order to recognize perceptually dominant sources in each one of moderately separated outputs of BSS algorithm. We compare the feasibility of different features in terms of validity rate of decisions and computational complexity.
  • Keywords
    computational complexity; direction-of-arrival estimation; speech processing; time-domain analysis; TDOA estimators; computational complexity; global permutation ambiguity; multidimensional localization; speaker specific features; speech signals; time domain BSS; validity rate; Computational complexity; Data mining; Frequency; Industrial electronics; Microphone arrays; Predictive models; Production systems; Sensor arrays; Speech processing; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Industrial Electronics & Applications, 2009. ISIEA 2009. IEEE Symposium on
  • Conference_Location
    Kuala Lumpur
  • Print_ISBN
    978-1-4244-4681-0
  • Electronic_ISBN
    978-1-4244-4683-4
  • Type

    conf

  • DOI
    10.1109/ISIEA.2009.5356310
  • Filename
    5356310