Title :
Speech processing with a cortical representation of audio
Author :
Mesgarani, Nima ; Shamma, Shihab
Abstract :
Neurophysiological studies in the primary auditory cortex have recently demonstrated a rich diversity of responses that provide an explicit multidimensional representation of phonemic acoustic features (Mesgarani 2008). Specifically, distinct subsets of cortical neurons are activated by articulatory gestures and dynamics that are characteristic of different phonemes. Here we use a computational cortical model to illustrate how these phonetic features appear in such a multiresolution representation. We also review how this representation has been successfully applied in variety of speech processing tasks including robust speech discrimination, speech enhancement and phoneme recognition.
Keywords :
speech enhancement; speech recognition; audio cortical representation; multiresolution representation; neurophysiological studies; phoneme recognition; speech enhancement; speech processing; Gabor filters; Modulation; Spectrogram; Speech; Speech enhancement; Speech recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947697