DocumentCode :
2182680
Title :
Speech processing with a cortical representation of audio
Author :
Mesgarani, Nima ; Shamma, Shihab
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
5872
Lastpage :
5875
Abstract :
Neurophysiological studies in the primary auditory cortex have recently demonstrated a rich diversity of responses that provide an explicit multidimensional representation of phonemic acoustic features (Mesgarani 2008). Specifically, distinct subsets of cortical neurons are activated by articulatory gestures and dynamics that are characteristic of different phonemes. Here we use a computational cortical model to illustrate how these phonetic features appear in such a multiresolution representation. We also review how this representation has been successfully applied in variety of speech processing tasks including robust speech discrimination, speech enhancement and phoneme recognition.
Keywords :
speech enhancement; speech recognition; audio cortical representation; multiresolution representation; neurophysiological studies; phoneme recognition; speech enhancement; speech processing; Gabor filters; Modulation; Spectrogram; Speech; Speech enhancement; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947697
Filename :
5947697
Link To Document :
بازگشت