• DocumentCode
    2926422
  • Title

    Microphone array based speech recognition with different talker-array positions

  • Author

    Omologo, Maurizio ; Matassoni, Marco ; Svaizer, Piergiorgio ; Giuliani, Diego

  • Author_Institution
    IRST-Istituto per la Ricerca Sci. e Tecnologica, Povo di Trento, Italy
  • Volume
    1
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    227
  • Abstract
    The use of a microphone array for hands-free continuous speech recognition in noisy and reverberant environment is investigated. An array of eight omnidirectional microphones was placed at different angles and distances from the talker. A time delay compensation module was used to provide a beamformed signal as input to a hidden Markov model (HMM) based recognizer. A phone HMM adaptation, based on a small amount of phonetically rich sentences, further improved the recognition rate obtained by applying only beamforming. These results were confirmed both by experiments conducted in a noisy and reverberant environment and by simulations. In the latter case, different conditions were recreated by using the image method to reproduce synthetic versions of the array microphone signals
  • Keywords
    acoustic noise; acoustic signal processing; acoustic transducer arrays; array signal processing; compensation; delays; hidden Markov models; microphones; reverberation; speech recognition; array microphone signals; beamformed signal; hands-free continuous speech recognition; hidden Markov model based recognizer; image method; microphone array based speech recognition; noisy environment; omnidirectional microphones; phone HMM adaptation; phonetically rich sentences; reverberant environment; talker-array positions; time delay compensation module; Array signal processing; Delay effects; Hidden Markov models; Image reconstruction; Laboratories; Microphone arrays; Predictive models; Speech recognition; Switches; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.599610
  • Filename
    599610