• DocumentCode
    542320
  • Title

    Hands-free continuous speech recognition in noise using a speaker beam-former based on spectrum-entropy

  • Author

    George, Nokas ; Evangelos, Dermatas

  • Author_Institution
    Department of Electrical & Computer Engineering, University of Patras, 26500, Hellas, Greece
  • Volume
    1
  • fYear
    2002
  • fDate
    13-17 May 2002
  • Abstract
    Detection of the speaker position is a crucial task in hands-free speech recognition applications. In this paper we present a novel speech beam-former for noisy environments. Initially, the localization algorithm extracts a set of candidate directions of the signal sources using array signal processing methods in the frequency domain. Then, a minimum variance (MV) beam-former identifies the speech signal in the direction where the signal´s spectrum entropy is minimized. The proposed method is evaluated by a phoneme recognition system using noise recordings from an air-condition fan and the TIMIT speech corpus. Extended experiments, carried out in the range of 25–0 dB, show almost perfect estimation of the speaker DOA in all cases. As a consequence, the recognition rate increases significantly compared to the rate obtained by a single microphone. The recognition improvement increases especially in very low SNRs.
  • Keywords
    Arrays; Entropy; Hidden Markov models; Robustness; Speech; Speech recognition; Three dimensional displays;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
  • Conference_Location
    Orlando, FL, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2002.5743882
  • Filename
    5743882