DocumentCode
117944
Title
Noisy speech recognition using blind spatial subtraction array technique and deep bottleneck features
Author
Kitaoka, Norihide ; Hayashi, Tomoki ; Takeda, Kazuya
Author_Institution
Nagoya Unviersity, Nagoya, Japan
fYear
2014
fDate
9-12 Dec. 2014
Firstpage
1
Lastpage
5
Abstract
In this study, we investigate the effect of blind spatial subtraction arrays (BSSA) on speech recognition systems by comparing the performance of a method using Mel-Frequency Cepstral Coefficients (MFCCs) with a method using Deep Bottleneck Features (DBNF) based on Deep Neural Networks (DNN). Performance is evaluated under various conditions, including noisy, in-vehicle conditions. Although performance of the DBNF-based system was much more degraded by noise than the MFCC-based system, BSSA improved the performance of both methods greatly, especially when matched condition training of acoustic models was employed. These results show the effectiveness of BSSA for speech recognition.
Keywords
acoustic signal processing; cepstral analysis; feature extraction; neural nets; speech recognition; BSSA; DBNF; DNN; MFCC; acoustic model training; blind spatial subtraction array technique; deep bottleneck feature; deep neural network; in-vehicle condition; mel-frequency cepstral coefficient; noisy speech recognition; Feature extraction; Hidden Markov models; Noise; Speech; Speech enhancement; Speech recognition; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Asia-Pacific Signal and Information Processing Association, 2014 Annual Summit and Conference (APSIPA)
Conference_Location
Siem Reap
Type
conf
DOI
10.1109/APSIPA.2014.7041556
Filename
7041556
Link To Document