Title :
Prosody-Preserving Voice Transformation to Evaluate Brain Representations of Speech Sounds
Author :
Bedenbaugh, Purvis ; Sarko, Diana K. ; Roth, Heidi L. ; Martin, Eugene M.
Author_Institution :
Dept. of Eng., East Carolina Univ., Greenville, NC, USA
fDate :
7/1/2010 12:00:00 AM
Abstract :
This study employs a voice-transformation to overcome the limitations of brain mapping to study brain representations of natural sounds such as speech. Brain mapping studies of natural sound representations, which present a fixed sound to many neurons with different acoustic frequency selectivity, are difficult to interpret because individual neurons exhibit considerable unexplained variability in the dynamical aspects of their evoked responses. This new approach samples how a single recording responds to an ensemble of sounds, instead of sampling an ensemble of neuronal recordings. A noise excited filter-bank analysis and resynthesis vocoder systematically shifts the frequency band occupied by sounds in the ensemble. The quality of the voice transformation is assessed by evaluating the number of bands the filter bank must have to support emotional prosody identification. Perceptual data show that emotional prosody can be recognized within normal limits if the bandwidth of filter-bank channels is less than or equal to the bandwidth of perceptual auditory filters. Example physiological data show that stationary linear transfer functions cannot fully explain the responses of central auditory neurons to speech sounds, and that deviations from model predictions are not random. They may be related to acoustic or articulatory features of speech.
Keywords :
biomedical communication; brain; channel bank filters; speech coding; vocoders; acoustic frequency selectivity; bioelectric potentials; brain mapping; brain representations; emotional prosody; nervous system; neuronal recordings; noise excited filter-bank analysis vocoder; noise excited filter-bank resynthesis vocoder; prosody-preserving voice transformation; speech analysis; speech coding; speech intelligibility; speech sounds; Auditory system; bioelectric potentials; identification; nervous system; speech analysis; speech coding; speech intelligibility; speech processing;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2009.2035165