Title :
Improving mobile phone based query recognition with a microphone array
Author :
Venkataramani, Swagath ; Velmurugan, R. ; Rao, Prahlada
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Technol. Bombay, Mumbai, India
fDate :
Feb. 28 2014-March 2 2014
Abstract :
With mobile phone penetration high and growing rapidly, speech based access to information is an attractive proposition. However, automatic speech recognition (ASR) performance is seriously compromised in real-world scenarios where background acoustic noise is omnipresent. Speech enhancement methods can help to improve the signal quality presented to the automatic speech recognition at the receiving end. These methods typically exploit spectral diversity to achieve separation of speech from noise. While this works for most background noise, it fails for noise arising from speech sources such as interfering speakers in the vicinity of the caller. In this paper, we investigate the potential advantages of generating spatial cues via stereo microphones on the mobile phone handset to enhance speech. Such, enhancement of foreground speech can be done using blind source separation (BSS). This, when applied to the stereo mixtures before transmission is shown to achieve a significant improvement in ASR accuracy in the context of a mobile-phone based agricultural information access system.
Keywords :
acoustic noise; blind source separation; microphone arrays; mobile handsets; speech enhancement; speech recognition; ASR; BSS; automatic speech recognition; background acoustic noise; blind source separation; microphone array; mobile phone handset; query recognition; signal quality; speech based access; speech enhancement; speech sources; stereo microphones; Accuracy; Microphones; Noise; Source separation; Speech; Speech enhancement; Speech recognition;
Conference_Titel :
Communications (NCC), 2014 Twentieth National Conference on
Conference_Location :
Kanpur
DOI :
10.1109/NCC.2014.6811299