• DocumentCode
    1721782
  • Title

    A Multichannel Noise Reduction Front-End Based on Psychoacoustics for Robust Speech Recognition in Highly Noisy Environments

  • Author

    Cifani, Simone ; Principi, Emanuele ; Rocchi, Cesare ; Squartini, Stefano ; Piazza, Francesco

  • Author_Institution
    3MediaLabs, Univ. Politec. delle Marche, Ancona
  • fYear
    2008
  • Firstpage
    172
  • Lastpage
    175
  • Abstract
    Microphone array systems, due to their spatial filtering capability, usually overcome the traditional mono approaches in noise reduction. Moreover, the employment of psychoacoustically motivated speech enhancement schemes typically allows to achieve a good balance between noise reduction and speech distortion. This drove some of the authors to merge the two advantageous aspects into a unique solution, allowing to achieve relevant performances in terms of enhanced speech quality in a wide range of operating conditions. Now, in this paper, the objective is assessing the effectiveness of the approach when applied as Noise Reduction Front-end to an Automatic Speech Recognition system working in adverse acoustic environments. Some computer simulations have been carried out and they show that a significant improvement of recognition rate is registered when such front-end is used, also w.r.t. the performances achievable when another Multichannel Noise Reduction architecture, not based on psychoacoustics concepts, is adopted on purpose.
  • Keywords
    distortion; microphone arrays; signal denoising; speech recognition; automatic speech recognition system; microphone array systems; multichannel noise reduction front-end; noisy environments; psychoacoustics; spatial filtering capability; speech distortion; speech enhancement schemes; speech recognition; Employment; Filtering; MONOS devices; Microphone arrays; Noise reduction; Noise robustness; Psychoacoustics; Speech enhancement; Speech recognition; Working environment noise; Automatic Speech Recognition; Multichannel Noise Reduction Front-end; Psychoacoustics; Sphinx-4 open source ASR;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Hands-Free Speech Communication and Microphone Arrays, 2008. HSCMA 2008
  • Conference_Location
    Trento
  • Print_ISBN
    978-1-4244-2337-8
  • Electronic_ISBN
    978-1-4244-2338-5
  • Type

    conf

  • DOI
    10.1109/HSCMA.2008.4538714
  • Filename
    4538714