• DocumentCode
    1721343
  • Title

    Distant Speech Recognition: Bridging the Gaps

  • Author

    McDonough, John ; Wölfel, Matthias

  • Author_Institution
    Spoken Language Syst., Saarland Univ., Saarbrucken
  • fYear
    2008
  • Firstpage
    108
  • Lastpage
    114
  • Abstract
    While great progress has been made in both fields, there is currently a relatively large rift between researchers engaged in acoustic array processing and those engaged in automatic speech recognition. This is unfortunate for many reasons, but most of all because it prevents the two sides, both of whom are investigating different aspects of the same problem, from truly understanding one another and cooperating. In many cases, the two sides see each other through the eyes of strangers. If ground breaking progress is to be made in the emerging field of distant speech recognition (DSR), this abysmal state of affairs must change. In this work, we outline five pressing problems in the DSR research field, and we make initial proposals for their solutions. The problems discussed here are by no means the only ones that must be solved in order to construct truly effective DSR systems. Nonetheless, their solution, in our view, will represent significant first steps towards this goal, inasmuch as the solution of each of these problems will require a substantial change in the mind-sets and thought patterns of those engaged in this field of research.
  • Keywords
    acoustic arrays; acoustic signal processing; array signal processing; speech recognition; DSR research; acoustic array processing; distant speech recognition; ground breaking progress; Acoustic distortion; Array signal processing; Automatic speech recognition; Eyes; Hidden Markov models; Higher order statistics; Independent component analysis; Microphone arrays; Particle filters; Speech recognition; automatic speech recognition; beamforming; joint denoising and dereveberation; microphone arrays; multi-step linear prediction; particle filter; speech feature enhancement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Hands-Free Speech Communication and Microphone Arrays, 2008. HSCMA 2008
  • Conference_Location
    Trento
  • Print_ISBN
    978-1-4244-2337-8
  • Electronic_ISBN
    978-1-4244-2338-5
  • Type

    conf

  • DOI
    10.1109/HSCMA.2008.4538699
  • Filename
    4538699