DocumentCode
178384
Title
Subband hybrid feature for multi-stream speech recognition
Author
Feipeng Li
Author_Institution
Center for Language & Speech Process., Johns Hopkins Univ., Baltimore, MD, USA
fYear
2014
fDate
4-9 May 2014
Firstpage
2484
Lastpage
2488
Abstract
A subband hybrid (SBH) feature is developed for multi-stream (MS) speech recognition. The fullband speech signal is decomposed into multiple subbands, each covers about 3 Bark along the frequency. Speech signal is analyzed by a high-resolution filterbank of 4 filters/Bark and a low-resolution filterbank of 2 filters/Bark to facilitate the representation of both short-term spectral modulation and long-term temporal modulation within a frequency subband. Experiments on TIMIT corpus for English and RATS corpus for Arabic Levantine show that the SBH feature significantly enhances the amount of information being extracted from individual subbands. The MS system with performance monitor achieves a substantial gain in performance over the single-stream baseline.
Keywords
channel bank filters; natural language processing; signal resolution; speech recognition; MS speech recognition; RATS corpus; SBH feature; TIMIT corpus; fullband speech signal; high-resolution filterbank; long-term temporal modulation; low-resolution filterbank; multistream speech recognition; short-term spectral modulation; subband hybrid feature; Frequency modulation; Monitoring; Noise; Rats; Speech; Speech recognition; multi-stream speech recognition; noise robustness; subband feature;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6854047
Filename
6854047
Link To Document