• DocumentCode
    455425
  • Title

    Spatial Separation of Speech Signals Using Continuously-Variable Masks Estimated From Comparisons of Zero Crossings

  • Author

    Park, Hyung-Min ; Stern, Richard M.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA
  • Volume
    4
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    This paper describes an algorithm that achieves noise robustness in speech recognition by reconstructing the desired signal from a mixture of two signals using continuously-variable masks. In contrast to current methods which use binary masks, this approach estimates the relative contribution of the desired source in a mixture of sources and reconstructs the desired signal in proportion to its estimated contribution to each time-frequency segment. Estimation of the continuously-variable masks is based on the relationship between the relative intensity of each source and the interaural time difference (ITD). Estimation of the ITD is accomplished using zero-crossing-based methods. It is shown that the use of zero-crossing approaches to estimate ITDs and continuously-variable masks provide better speech recognition accuracy than cross-correlation-based approaches to ITD estimation and binary masks
  • Keywords
    source separation; speech processing; speech recognition; time-frequency analysis; continuously-variable masks; cross-correlation-based approaches; interaural time difference; signal reconstruction; spatial separation; speech recognition; speech signals; time-frequency segment; zero-crossing-based methods; Acoustic noise; Automatic speech recognition; Frequency estimation; Humans; Information analysis; Noise robustness; Speech coding; Speech recognition; Time frequency analysis; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1661181
  • Filename
    1661181