• DocumentCode
    3528631
  • Title

    A study on recognizing distorted speech over local distributed transducer networks

  • Author

    Zhao, Yong ; Shin, Sunghwan ; Robledo-Arnuncio, Enrique ; Juang, Biing-Hwang Fred

  • Author_Institution
    Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    4181
  • Lastpage
    4184
  • Abstract
    In a collaborative scenario, a multiplicity of portable devices may constitute a network of distributed microphones, without a clearly defined geometric configuration or synchronization that can be taken advantage of for traditional microphone array processing to enhance the acquired signal. This application scenario represents a severe, but interesting challenge for automatic speech recognition systems. In this paper, we investigate a variety of robust speech recognition techniques with a focus on the distributed transducer scenario. We also report some important study results that lead to new thinking in the design of robust speech recognition for broadened applications. Two issues that are inherent to distributed transducer networks are specially investigated. First, we study the effect of the sampling rate skew of microphones to the system performance; second, we explore the possibility of combining recognition hypotheses from multiple transducer channels for improved recognition accuracy.
  • Keywords
    array signal processing; microphones; speech recognition; automatic speech recognition systems; collaborative scenario; distorted speech recognition; distributed microphones; local distributed transducer networks; microphone array processing; portable devices; Array signal processing; Automatic speech recognition; Collaboration; Microphones; Robustness; Sampling methods; Signal processing; Speech recognition; System performance; Transducers; Robust speech recognition; distributed transducer network; sampling rate skew; system combination;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960550
  • Filename
    4960550