DocumentCode
3746575
Title
Auditory features for the close talk speech enhancement with parameter masks
Author
Yi Jiang;Yuanyuan Zu;Runsheng Liu
Author_Institution
The Quartermaster Equipment Research Institute, CPLA Beijing, P.R. China
fYear
2015
Firstpage
1194
Lastpage
1198
Abstract
The speech segregation and enhancement is a hard task in speech communication. In order to get the clean target speech, a close talk system is used to collect the speech with a nearby microphone. A deep neural networks (DNN) estimator is used in a frequency channel for speech energy calculation with parameter masks. The adjusted binaural auditory features are used as the main input for DNN speech energy estimation. The energy difference between the two microphones is used as the main binaural auditory feature. The time difference is also used as the comparison feature. Experiments show the energy difference feature can get the similar performance to the combination two microphones monaural and binaural auditory features with limited calculation complexity. The two microphones energy difference feature is one of the key features in close talk speech enhancement.
Keywords
"Speech","Filter banks","Microphones","Feature extraction","Ear","Speech enhancement"
Publisher
ieee
Conference_Titel
Image and Signal Processing (CISP), 2015 8th International Congress on
Type
conf
DOI
10.1109/CISP.2015.7408062
Filename
7408062
Link To Document